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Abstract 

Let {Xn)n>o be an irreducible, aperiodic, homogeneous Markov chain, 
with state space an ordered finite alphabet of size m. Using combi- 
natorial constructions and weak invariance principles, we obtain the 
limiting shape of the associated Young tableau as a multidimensional 
Brownian functional. Since the length of the top row of the Young 
tableau is also the length of the longest (weakly) increasing subse- 
quence of (^fc)i<fc<ni the corresponding limiting law follows. We re- 
late our results to a conjecture of Kuperberg by showing that, under a 
cyclic condition, a spectral characterization of the Markov transition 
matrix delineates precisely when the limiting shape is the spectrum 
of the traceless GUE. For m = 3, all cyclic Markov chains have such 
a limiting shape, a fact previously known for m = 2. However, this is 
no longer true for m > 4. 
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1 Introduction 



The identification of tlie limiting distribution of LJ„, the length of the longest 
increasing subsequence of a random word of length n, whose letters are iid and 
chosen uniformly from an ordered, m-letter alphabet, was first made by Tracy 
and Widom [28j. They showed that the limiting distribution of L/^, properly 
centered and normalized, is that of the largest eigenvalue of the traceless m x 
m Gaussian unitary ensemble (GUE). In the non-uniform iid case. Its, Tracy, 
and Widom [201 121] described the corresponding limiting distribution as that 
of the largest eigenvalue of one of the diagonal blocks (corresponding to the 
highest probability) in a direct sum of certain independent GUE matrices. 
The number and respective dimensions of these matrices are determined by 
the multiplicities of the probabilities of choosing the letters, and the direct 
sum is subject again to an overall zero-trace type of condition. 

The well-known Robinson-Schensted-Knuth (RSK) correspondence be- 
tween sequences and pairs of Young tableaux led Tracy and Widom [2S] to 
conjecture that the (necessarily m-row) Young tableau of a random word gen- 
erated by an m-letter, uniform iid sequence has a limiting shape given by the 
joint distribution of the eigenvalues of a m x m traceless element of the GUE. 
Since the length of the longest row of the Young tableau is precisely Lin, this 
appears to be a natural generalization. Johansson [22] proved this conjecture 
using orthogonal polynomial methods. Further, Okounkov [27j , and Borodin, 
Okounkov, and Olshankii [6j, as well as Johansson [22], also answered a con- 
jecture of Baik, Deift, and Johansson [1, 2j regarding the limiting shape of 
the Young tableau associated to a random permutation of {1, 2, . . . , n}. In 
particular, as n grows without bound, the lengths Ai, A2, . . . , of the first k 
rows of the Young tableau, appropriately centered and scaled, have the same 
limiting law as the k largest eigenvalues of a n x n element of the GUE, a 
result first proved, for = 2, in [H [2]. 

The extension to the non-uniform iid case was addressed to some degree 
in Its, Tracy, and Widom [20l [21], who focused primarily on the top row of 
the Young tableau. Here the obvious conjecture is that the limiting shape 
has rows whose suitably centered and normalized lengths have a joint dis- 
tribution which is that of the whole spectrum of the direct sum of GUE 
matrices described above. Below, we prove this result as a special case of the 
Markovian framework. 

Kuperberg [23] conjectured that if the word is generated by an irreducible, 
doubly-stochastic, cyclic Markov chain, then the limiting distribution of the 



2 



shape is still that of the joint distribution of the eigenvalues of a traceless 
m X m element of the GUE. For m = 2, this was shown to be true by 
Chistyakov and Gotze [9], who, in view of further simulations, expressed 
doubts concerning the validity for m > 4. For m = 3, we will show that 
the conjecture holds as well. However, for m > 4, this is no longer the case. 
Indeed, some, but not all, cyclic Markov chains lead to a limiting law as in 
the iid uniform case already obtained by Johansson [22] ■ 

The precise class of homogeneous Markov chains with which Kuperberg's 
conjecture is concerned is more specific than the ones we shall study. The 
irreducibility of the chain is a basic property we certainly must demand: each 
letter has to occur at some point following the occurrence of any given letter. 
Moreover, the doubly-stochastic hypothesis ensures that we have a uniform 
stationary distribution. However, the cyclic criterion, i.e., the Markov tran- 
sition matrix P has entries satisfying pij = pj+ij+i, for 1 < i, j < m (where 
m-|-l = 1), is more restrictive: cyclicity implies but is not equivalent to P be- 
ing doubly stochastic. Kuperberg was led to introduce this latter restriction 
via simulations [21] inspired by mathematical physics considerations, which 
appear to show that at least some irreducible, doubly-stochastic, non-cyclic 
Markov chains do not produce such limiting behavior. 

Let us also note that Kuperberg implicitly assumes the Markov chain to 
also be aperiodic. Indeed, the simple 2-state Markov chain for the letters 
«! and a2 described by P(X„+i = ai|X„ = aj) = 1 for i 7^ j, produces a 
sequence of alternating letters, so that LJ„ is always either n/2 or n/2 + 1, for 
n even, and {n+l)/2, for n odd, and so has a degenerate limiting distribution. 
Even though this Markov chain is irreducible, doubly-stochastic, and cyclic, 
it is periodic. 

The paper is organized in the following manner. In Section 2, we present 
the simple combinatorial formulation of the L/„ problem the authors devel- 
oped in [18]. Next, in Section 3, we use this formulation to rederive the 
two- letter Markov case first studied by Chistyakov and Gotze [9]. Then, in 
order to extend these results to alphabets of size m > 3, we introduce, in Sec- 
tion 4, a slight modification of our original combinatorial development, and 
so obtain a functional of combinatorial quantities which describes the shape 
of the entire Young tableau with n cells, along with a concise expression 
for the associated asymptotic covariance structure. In Section 5, we apply 
Markovian Invariance Principles to express the limiting shape of the Young 
tableau as a Brownian functional for all irreducible, aperiodic, homogeneous 
Markov chains (without the cyclic or even the doubly-stochastic constraint.) 
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Using this functional we are then able to answer Kuperberg's conjecture. In 
Section 6, we investigate, in further detail, various symmetries exhibited by 
the Brownian functional. In particular, we clarify the asymptotic covariance 
structure in the cyclic case, and obtain, for m arbitrary, a precise descrip- 
tion of the class of cyclic Markov chains having the same limiting law as in 
the uniform iid case. In Section 7, we further explore connections between 
the various Brownian functionals obtained as limiting laws and eigenvalues 
of random matrices. Finally, in Section 8, we conclude with a brief discus- 
sion of natural extensions and complements to some of the ideas and results 
presented in the paper. 

2 Combinatorics 

As in [18], one can express LJ„ in a fundamentally combinatorial manner. 
For convenience, this section recapitulates that development. 

Let (X„)„>i consist of a sequence of values taken from an m-letter ordered 
alphabet. Am = {«! < 0:2 < ■ ■ ■ < «m}- Let be the number of occur- 
rences of ttr among (Xj)i<j<fc. Each increasing subsequence of {Xi)i<i<k 
consists simply of consecutive identical values, with these values forming 
an increasing subsequence of a,.. Moreover, the number of occurrences of 
ttr G {ai, . . . ,am} among (Xj)fc+i<j<£, where 1 < k < i < n, is simply 
— a''f^. The length of the longest increasing subsequence of Xi, X2, . . . , X„ 
is thus given by 

LIn= max [(4^-4) + (a^^-a^J + ... + (a™-a^_J], (2.1) 

0<Kl<--- 

<km-l<n 

i.e., 

= max [{al - alj + {a^ - alj + ■ ■ ■ + (a--\ - a^J + (2-2) 

0<Kl<--- 

<km-i<n 

where Oq = 0. 

For i = 1, . . . ,n and r = 1 , . . . , m — 1 , let 
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{1, if Xi = Or, 

-1, if X, = ar+u (2.3) 
0, otherwise, 

and let 5"^ = Yl'i=i ^ — ^, ■ ■ ■ ,n, with also Sq = 0. Then clearly = 
al — flfc"^^. Hence, 

L/„= max {Sl + Sl + ^-^ + SZ^.+a^:}. (2.4) 

0<fci<-- 

<km-i<n 

By the telescoping nature of the sum YlT=r = Yll^=r^'^n ~ '^n^^)) 
find that, for each 1 < r < m — 1, aj^ = + Yl!k=r ^n- Since a^, . . . , a™ 
must evidently sum to /c, we have 



n 



r=l 

m-1 / 

r=l \ k=r 
m—1 

+ ma:;:. 



m—l 



r=l 



Solving for a™ gives us 



m—l 




r=l 



Substituting into (12 .4^ . we finally obtain 

LI^ = ---yrS:+ max {S^ + Sl + ■ ■ ■ + S^-\}. (2.5) 

m o<fci<- ' ' "'"-^ 

As was emphasized in |18] (12. 5p is of a purely combinatorial nature or, in 
more probabilistic terms, is of a pathwise nature. We now proceed to analyze 
(12.51) in the case of a Markovian sequence. 
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3 Markovian Alphabet: 2-Letter Case 

We begin our study of Markovian alphabets by concentrating on the 2-letter 
case. Here {Xn)n>o is described by the following transition probabilities 
between the two states (which we identify with the two letters ai and 02): 
F{Xn+i = a2\Xn = ai) = a and P(X„+i = ailX^ = 0:2) = b, where < 
a + b < 2. We later examine the degenerate cases a = b = and a = b = 1. 
In keeping with the common usage within the Markov chain literature, we 
begin our sequence at n = 0, although our focus will be on n > 1. Denoting 
by {Pn,Pn) the vector describing the probability distribution on {ai,a2} at 
time n, we have 



The eigenvalues of the matrix in fl3.ll) are Ai = 1 and — 1 < A2 = 1 — 
a — b < 1, with respective left eigenvectors (vri, 112) = {b/ (a + 6), a/ (a + h)) 
and (1, —1). Moreover, (tti, 712) is also the stationary distribution. Given any 
initial distribution (Po;Po); ^"^^ that 



as n — > 00, since A2 < 1. 

Our goal is now to use these probabilistic expressions to describe the 
random variables and S\ defined in the previous section. (We retain the 
redundant superscript "1" in Z]. and S]. in the interest of uniformity.) 

Setting (3 = apl — bp^, we easily find that 




(3.1) 



(Pn,Pn) = (7ri,vr2) + A^' 



,, apl-bpl 
^ a + b 



(1,-1) ^ (7ri,7r2) 



(3.2) 




a + b 



a + b 



(3.3) 



for each 1 < k < n. Thus, 




(3.4) 
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and so KSl/k — > (6 — a)/(a + b), as ^ oo. 

Turning to the second moments of Zl and Si, first note that E(Z^)^ = 
1, since {ZlY = 1 a.s. Next, we consider ¥.Z\Z}, for k < i. Using the 
Markovian structure of {Xn)n>o, it quickly follows that 



^ ^ (vTi - Xt''^^) {712 - A^^) , if (xfc, xe) = (ai, a^), 
(tts - A^-'^^) (vTi + A^^) , if (xfc, x^) = (aa, "i), 
. (^2 + A^'^^) (vrs - A^^) , if (xfc, Xe) = (aa, aa). 



For simplicity, we will henceforth assume that our initial distribution is 
the stationary one, i.e., {pi, pi) = (7ri,7r2). Later, (see Concluding Remarks) 
we drop this assumption and deal with initial distributions concentrated 
on an arbitrary state. Under this assumption, j3 = 0, KSl = kfj,, where 
fi = KZl = {b — a)/{a + b), and (13.51) simplifies to 



P((Xfc,X,) = {xk,x,)) 



if {xk,X() 
if {xk,X() 
if {xk,xi) 
if {xk,X() 



(ai,ai), 
("1,02), 
(a2,ai), 
(a2,a2)- 



^ , 1^1 - A2 712, 

. (vr2 + A^''^) TTa, 



(3.6) 



We can now compute KZlZj: 



EZlZj = ¥{ZlZj = +1) - nzlZj = -1) 
= F{{Xk,Xe) G {(ai,ai),(a2,«2)}) 

-P((Xfc,X,) e {(«i,a2),(«2,«i)}) 




7 



{b - aY Aah 
(a + 6)2 + (a + 6)2 ^2 



Hence, recalling that /3 = 0, 



:= YwZl = 1 



— a 



a + b 



4ab 



(a + 6)2' 

for all A; > 1, and, for k < i, the covariance of and Zj is 



Proceeding to the covariance structure of Si, we first find that 



k 

Vs^rSl = VarZj + 2 Cov(Z], Z/) 

j=i j<e 

■A^+^-A;A2 + (A:-l)A5 



a^A; + 2a^ 



(1-A2)2 



(3.8) 



^-(^^^ ^1) - 1^ + ^^Ar^ - (^) ^ .^Ar^. (3.9) 



Next, for k < i, and using (13. 9p and (13.101) . the covariance of Si and Sj 
is given by 



k e 

i=l 0=1 

k k I 

= 5^VarZ,^ + 2 J] Cov(Z^ Zj) + ^ ^ Cov(Z^ Z]) 

i=l i<j<k 1=1 j=k+l 
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k e 

1=1 j=k+l 

Var6, + a ^ _ ^^^^ 

1 + A2\ , A2(l-A^)(l + A^'^^ 



I-X2J (1-A: 
From (13.101) and (13. lip we see that, as k ^ 00 



fe- " .rx r (3-11) 



A; Vl-A 
and, moreover, as A ^ 00, 



Cov(Sl.S.) .,.fi±A., (3,13) 



(kAi) \1-X 
When a = b, KSl = 0, and in (13.12^ the asymptotic variance becomes 



Var^^ ^ 4a^ / l + (l-2a) \ 
' (2a)2 Vl - (1 -2a) J 



For a small, we have a "lazy" Markov chain, that is, a Markov chain which 
tends to remain in a given state for long periods of time. In this regime, the 
random variable Si has long periods of increase followed by long periods of 
decrease. In this way, linear asymptotics of the variance with large constants 
occur. If, on the other hand, a is close to 1, the Markov chain rapidly shifts 
back and forth between cti and a2, and so the constant associated with the 
linearly increasing variance of 5*^ is small. 

As in [18], Brownian functionals play a central role in describing the lim- 
iting distribution of LJ„. By a Brownian motion on [0, 1] we shall mean an 
a.s. continuous, centered Gaussian process having stationary, independent 
increments, and which is zero at the origin. By a standard Brownian motion 
B{t), < t < 1, we shall further require that VaiBit) = t, < t < 1, i.e., we 
endow C[0, 1] with the Wiener measure. A standard m- dimensional Brown- 
ian motion will be defined to be a multivariate process consisting of m in- 
dependent Brownian motions. More generally, an m- dimensional Brownian 
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motion shall refer to a linear transformation of a standard m-dimensional 
Brownian motion. Throughout the paper, we assume that our underlying 
probability space is rich enough so that all the Brownian motions and se- 
quences we study can be defined on it. 

To move towards a Brownian functional expression for the limiting law 
of Lin, define the polygonal function 



for < t < 1. In our finite-state, irreducible, aperiodic, stationary Markov 
chain setting, we may conclude that i?„ =^ B, as desired. (See, for example, 
Gordin's martingale approach to dependent invariance principles [15], and the 
stationary ergodic invariance principle found in Theorem 19.1 of Billingsley 

Turning now to L/„, we see that for the present 2-letter situation, (12.51) 
simply becomes 

Lin = "S"^ + max Sl. 

2 2 " l<k<n ^ 

To find the limiting distribution of L/„ from this expression, recall that 
TTi = 6/(a + 6), = a/(a + &), = 111 — 1x2 = {b — a)/{a + b), o"^ = 
Aab/{a + 6)^, and that A2 = 1 — a — 6. Define iXmax = max{7ri,7r2} and 

= a\l + A2)/(l - A2). Rewriting (imj) as 

J^n[t) = —= h 



a\/n a\/n 

Lin becomes 

Lin ^ 2 ~ \ (^^VnBn{l) + fJ'nj + max {dy/nBn{t) + If-nt 
= n'7i2 - (dr^/nBn{l)] + max (ay/nBnit) + (tti - 'K2)nt 



mi. 



max 



+ max {a^/nI3n{t) + (tti - Ti2)nt - {-Kmax - 7<'2)nj . (3.15) 
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This immediately gives 



= -o^nlJ-y 

a\/n I 



'n \ 
+ max ( Bn{t) + ^((vTi - 712)1 - (vr^ax - VTa)) . (3.16) 

0<t<l \ (7 J 

Let us examine (13.161) on a case-by-case basis. First, if Umax = ni = 7^2 = 
1/2, i.e., if a = 6, then a = 1 and a = (1 — a)/a, and so (13.161) becomes 



""^^^ = -\Bnil) + max 5„(t). (3.17) 
^{l-a)n/a 2 o<t<i 

Then, by the Invariance Principle and the Continuous Mapping Theorem, 

~ ""^^ ^ --5(1) + max B{t). (3.18) 
v/(l-a)n/a 2 o<t<i ^ ' ^ ' 

Next, if TTmax = T^2 > TTi, (I3.16p becomes 



---D„ i 



(JVn 2 



+ max ( Bn{t) - ^{TTmax - Tri)t ] . (3.19) 

o<t<i \ a 



On the other hand, if Timax = tti > 712, ( I3.16P becomes 

= -o^nll) 

/ " \fn 
+ max I Bn{t) —{j^max - 7r2)(l - t) 

= \BniX) 

max ( 5„(t) - B„(l) - ^{Tlmax " 7r2)(l - i) ) • 
0<i<l \ CT 



(3.20) 
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In both fl3.19l) and fl3.20l) we have a term in our maximal functional which 
is linear in t or 1 — t, with a negative slope. We now show, in an elementary 
fashion, that in both cases, as n oo, the maximal functional goes to zero 
in probability. 

Consider first fl3.19p . Let c„ = y/n^Hmax — t^\)I^ > 0, and for any c > 0, 
let Mc = maxo<f<i(-B(t) — ct), where {B{t)) is a standard Brownian motion. 
Now for n large enough, 

Bnit) -Ct> ]3n{t) - Cnt 

a.s., for all < t < 1. Then for any z > 0, and n large enough. 



P(max(5„(t) - c„t) > z) < P(max(5„(t) - ct) > z), (3.21) 
and so by the Invariance Principle and the Continuous Mapping Theorem, 



limsupP(max(i?„(t) - Cnt) > z) < lim P(max(i?„(t) - ct) > z) 

n^oo 0<t<l n^oo 0<t<l 

= P(Mc > z). (3.22) 

Now, as is well-known, P(Mc > -z) ^ as c — > oo. One can confirm this 
intuitive fact with the following simple argument. For 2; > 0, c > 0, and 
< e < 1, we have that 



P(Mc > z) < P(max(5(t) - ct) > z) + P(max(5(t) - ct) > z) 

0<t<e e<t<l 

< P(max B{t) > z)+ P(max(E(t) - ce) > z) 

0<t<e e<t<l 

< P(max B(t) > z)+ P(max B(t) > ce + z) 

0<t<e 0<t<l 

= 2(^l-$(^-^^^ +2(l-<l>(c£ + z)). (3.23) 

But, as c and e are arbitrary, we can first take the limsup of fl3.23p as c — > 00, 
and then let e — 0, proving the claim. 
We have thus shown that 

limsupP(max(5„(i(;) - c„t) > z) <0, 

n^oc 0<t<V 
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and since the functional clearly is equal to zero when t = 0, we have 



max(BJt) - Cnt) 0, (3.24) 

0<t<l 

as n ^ oo. Thus, by the Continuous Mapping Theorem, and the Converging 
Together Lemma, we obtain the weak convergence result 

Lin - T^rnaxU (3 35) 



Lastly, consider fl3.20p . Here we need simply note the following equal- 
ity in law, which follows from the stationary and Markovian nature of the 
underlying sequence (X„)„>o: 



a 



= -4(1 - t) - ^{^ma. - vr2))(l - t), (3.26) 
a 



for t = 0, l/ra, . . . , (n — l)/n, 1. With a change of variables (m = 1 — t), and 
noting that B{t) and —B{t) are equal in law, our previous convergence result 
<^2^ implies that 



max(S„(t) - Bn{\) - Cn(l - t)) = max {-Bn{u) - c„m) 0, (3.27) 

0<t<l 0<M<1 

as ^ oo. Our limiting functional is thus of the form 

Lin - T^rnaxU ^ 1^^^^^ ^g) 

aJn 2 



Since -B(l) is simply a standard normal random variable, the different signs 
in f l3.25p and (13.281) are inconsequential. 

Finally, consider the degenerate cases. If either a = or 6 = 0, then the 
sequence (X„)„>o will be a.s. constant, regardless of the starting state, and 
so Lin ~ On the other hand, if a = 6 = 1, then the sequence oscillates 
back and forth between ai and 0^2, so that L/„ ^ njl. Combining these 
trivial cases with the previous development, we have proved the following 
theorem: 
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Theorem 3.1 Let (X„)„>o be a 2-state Markov chain, with P(X„+i = 
Xn = ai) = a and P(X„_|_i = ai|X„ = 02) = b. Let the law of Xq be the 
invariant distribution (tti, H2) = (6/ (a + 6), a/ {a + b)), for < a + 6 < 2, and 
(tti, 1^2) = (1, 0), for a = b = 0. Then, for a = b > 0, 



^^V^i"2 



fi(l) + max^mV (3.29) 

o<t<i J 



where (-B(t))fg[o,i] is a standard Brownian motion, and for a 7^ b or a = 6 = 0, 
and -Umax = max{7ri, 712}, 



Lin '^max^ -\t t r\ ~2 



n 



X(0,aV4), (3.30) 



where X(0, 5"^/4) is a centered normal random variable with variance cr^/4 = 
ab{2 — a — b)/{a + 6)^, for a ^ b, and a'^ = 0, for a = b = 0. (If a = b = 
1, or = 0, then the distributions in fl3.29p and (13.301) . respectively, are 
understood to be degenerate at the origin.) 

To extend this resuh to the entire Young tableau, let us introduce the 
following notation. By 

iY^'\Y(^\ . . . , yi^)) ^ {YW,Yi^\ . . . , y W) (3.31) 
we shall mean the weak convergence of the jomt law of the fc- vector {Yn^\ y/^^ 



n , 



. . . , Yn''^) to that of {Y,^\ Y^\ . . . , Y^^), as n — > cxd. Since L/„ is the length 
of the top row of the associated Young tableau, the length of the second row 
is simply n — Lin- Denoting the length of the i^^ row by LY^, ( 13.3ip . to- 
gether with an application of the Cramer- Wold Theorem, recovers the result 
of Chistyakov and Gotze [9] as part of the following easy corollary, which is 
in fact equivalent to Theorem 13.11 



Corollary 3.1 For the sequence in Theorem \3.1[ if a = b > 0, then 

(^^^,^Xl^-^^y^...^yS>,y.>^, (3,32) 

where the law ofY^o is supported on the 2"°' main diagonal ofM."^, and with 
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If a ^ b or a = b = 0, then setting T^min = niin{7ri, 1x2}, we have 



1 . ^ ^ TV2 _ „ . .,, , 

iV((0,0),S), (3.33) 



'n \/n 
where S is the covariance matrix 



...... , 1 -1 



where = Aab{2 — a — b)/{a + b)^ , for a ^b, and a"^ = 0, for a = b = 0. 

Remark 3.1 The joint distributions in (13.321) and (13.331) are of course de- 
generate, in that the sum of the two components is a.s. identically zero in 
each case. In (I3.32p . the density of the first component ofY^o is easy to find, 
and is given by (e.g., see fT^) 
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3/2 



As in Chistyakov and Gotze 19], (13.321) can then be stated as: For any 
bounded, continuous function g -.M? ^M., 



LY^: - n/2 LY^ - n/2 



lim 9 1 > 

V \v(l-a,)n/a ^J{l-a)n/c 

POO 

2V27T / g{x, -x)(j)GUE,2{x, -x)dx, 



where (pGUE,2 is the density of the eigenvalues of the 2x2 GUE, and is given 
by 

<f)GUE,2{^u^2) = -(xi -a;2)^e-(^'?+^2\ 
vr 

To see the GUE connection more explicitly, consider the 2x2 traceless 
GUE matrix 
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Mn 



Xi Y + iZ 
Y -iZ X2 



where Xi, X2, Y , and Z are centered, normal random variables. Since Corr (Xi, X2) 
— 1, the largest eigenvalue of Mq is 



Ai,o = ^Jxf + Y^ + Z-^, 



almost surely, so that q ~ xi ^/ ^'^'^ ^1 = ^o,"!^ ^ = ^o,"!^ Z = 1. Hence, up 
to a scaling factor, the density of Xi^ is given by (13.341) . Next, let us perturb 
Mo to 



M = aGI + /3Mo, 

where a and j3 are constants, G is a standard normal random variable inde- 
pendent of Mq, and I is the identity matrix. The covariance of the diagonal 
elements of M is then computed to be p := o? — ^5"^ . Hence, to obtain a given 
value of p, we may take a = a/(1 + p)/2 and (3 = ^/(l — p)/2. Clearly, the 
largest eigenvalue of M can then be expressed as 



Ai = ^^^G'+^i^Ai,o. (3.35) 

At one extreme, p = —1, we recover Ai = Ai^o- At the other extreme, p = 1, 
we obtain Ai = Z . Midway between these two extremes, at p = 0, we have a 
standard QUE matrix, so that 

Ai = y|(G + Ai,o). 



4 Combinatorics Revisited 

The original combinatorial development for the m-letter alphabet resulted in 
m — 1 quantities S'^,l<r<m — 1. In the 2- letter case we were then able to 
proceed with a probabilistic development which involved a single Brownian 
motion. Using an even more straightforward development which involves m 
quantities instead, we can obtain more symmetric expressions for LJ„. This 
is done next, and will prove useful when studying the shape of the whole 
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Young tableau. 



Recall that counts the number of occurrences of among (Xj)i<i<fc. 
Moving beyond the purely combinatorial setting, assume that {Xk)k>o is a 
doubly-infinite sequence generated by an irreducible homogeneous Markov 
chain having a stationary distribution (tti, 7T2, . . . , iTm)- (For no A; > is the 
law of Xfc necessarily assumed to be the stationary distribution.) For each 
1 < r < m, set = — T^rk, for k > 1, and Tq = 0. Beginning again with 
(lO) . we find that 



LL 



n — max 

0<fci<--- 

<km~i<n 



max 

0<fci<--- 

<km-l<n 



« - «o) + « -4J + ■■■ + «- 

((r,\ + TTifci) - (T^ + vrifco)) + iK + vr^A;^) - (T^ + n,k^)) 



+ ■■■ + {{TZ + nrnkm) - {TZ_, + T^mkm-l)) 



max 

0<fci<-- 

<km-i<n 



K - Tl) + {Tl - Tl) + . . . + (T™ - T-_J 



+ 'Ki{ki - ko) + ■K2{k2 - fci) H h vrm(A;m - A;m_i) 

Setting Umax = maxjTTi, 7r2, . . . , VTm}, (14.1 p becomes 



(4.1) 



LJ„ - TTmaxn = HiaX [(T^' - T: ) + (tT,. - 7rrnax){kr " A;^-! 



0=ko<ki< 



(4.2) 

For a uniform alphabet, iTmax = iTr = 1/m, for all r, and (14. 2 p simplifies to 

m 

L/„--= max 5^(T,;-T,;_J. (4.3) 

m 0=ko<ki<- ^ 

To introduce a random walk formalism into the picture, we next set, for 
i = 1, . . . ,n and r = 1, 2, . . . , m, 

lO, otherwise. 
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Clearly, al = ^ti and so Tl = EIi(W - vr,), for 1 < r < m. 



To understand the limiting law of (14.21) or (14. 3p . we must have a more 
precise description of the underlying Markovian structure. To that end, let 
Pr,s = IP(-^fc+i = (y.s\Xk = ar) be the transition probability from state to 
state as, and let P = {pr,s) be the associated Markov transition matrix. In 
this setting. 



Moreover, as usual, let prj denote the fc-step transition probability from ar 
to as', its associated transition matrix is simply P^. 

Assume now that the law of Xq is the stationary distribution. Thus, by 
construction, ET^ = for all 1 < r < m and 1 < k < n, and our primary 
task is to describe the covariance structure of these random variables T^. 



Since W[ is, simply, a Bernoulli random variable with parameter vr^, 
VarPV/" = 7ir{l - TTr). Wc then find that, ior k > 1, 




• • • ) Prn' ) ~ (Pl ) ^2 ? ■ ■ • ) Pm)-P- 




\ i=l / 

k fe-1 k 




k ii — 1 




(4.5) 



ii=2 12=1 



By stationarity, (14. 5 p becomes 



k k-1 k 




ii=2 12=1 

fe-1 k 



ii=l i2=«l+l 
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k ii — 1 

jl=2 42 = 1 

k 

ii=l i2=n+l 
k ii — l 

+ 7r,5^^e,P---e^, (4.6) 

41=2 42 = 1 

where = (0, 0, . . . , 0, 1, 0, . . . 0) is the r*'' standard basis vector of M*". 
Setting 

fc-l A; k 

Q^ = llll P'^-'^ = Y.{k-^)P\ (4.7) 

ii=l 12=11+1 i=l 

we can rewrite (14. 6 p in the simple form 

VarT^ = knr - Ptt"^ + 27r,e,Qfce^. (4.8) 

Our description of the covariance structure can now be completed using 
the above results. For ri ^ r2 and k > 1, 

k k-1 k 

1=1 11=1 12=41 + 1 

+Ei;cov(WA-,w,-) 

ii=2 42=1 
k k-1 k 

= Y.CoviW[\Wn + Y, E Coy{W^\W[^Uj 

4 = 1 41 = 1 42=41 + 1 

k 41 — 1 

+ EECov(W^o",W^?-42) 

41=2 22 = 1 

fc-l fc 

= -klTr^TTr^ + E E (^^lPri,r2'^ ~ ^r^'^r^) 

41=1 42=41+1 
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k — 1 
il=2 12=1 

k 

ii=l i2=ii+l 
k — 1 

41=2 42 = 1 

= -fc^TT^.TTra + T^r^er^Qker^ + T^ri^riQ ke^^- (4.9) 



Remark 4.1 Both (14.81) and (14.91) appear to be asymptotically quadratic in 
k. However, since Qk = ^\=i{k — i)P^ , cancellations will show that when the 
Markov chain is irreducible and aperiodic, the order is, in fact, linear in k. 

In order to further analyze the asymptotics of Qk-, we first examine the 
diagonalization of P for a very general class of transition matrices. 

Proposition 4.1 Let P be the m x m transition matrix of an irreducible, 
aperiodic, homogeneous Markov chain with eigenvalues 1 > IA2I > • • ■ > |Am|, 
and let A = diag{l, A2, . . . , A^). Let P = S^^AS be the diagonalization of 
P, where the rows of S consist of the left-eigenvectors of P, with, moreover, 
the first row of S being the stationary distribution (tti, 712, ... , Hm)- Then the 
first column of S^^ is (1,1,..., 1)-^. 

Proof. Since P = S^^AS, then PS^^ = S'^^A. Denoting the first column 
of S^^ by Ci, we have Pci = C\. But since the rows of P sum to 1, we see 
that ci = (1, 1, . . . , 1)^ satisfies Pc\ = ci. Moreover, ci must be unique, 
up to normalization, since the irreducibility of P implies that Ai = 1 has 
multiplicity 1. Finally, since the inner product of the first row of S and the 
first column of 5*"^ is 1, the correct normalization is indeed (1, 1, ... , 1)^. ■ 

Returning to Qk, as given in (14.71) . and using Proposition 14.11 we then 
obtain: 

Theorem 4.1 Let (X„)„>o be a sequence generated by an m-letter, aperiodic, 
irreducible, homogeneous Markov chain with state space Am = {0^1 < ■ ■ ■ < 
am}, transition matrix P, and stationary distribution (vri, 7r2, . . . , VTm)- Let 
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also the law of Xq he the stationary distribution. Moreover, for 1 < r < m, let 
Tl = a]^ — TTrk, for k > 1, and Tq = 0, where is the number of occurrences 
of Or among (Xj)i<j<fc. Then, for 1 < r < m. 



Var T'' 

lim ——J^ = TT, (1 + 2erS-^DSe^) , (4.10) 
fc— >oo k 



and for ri ^ r2. 



lim ^^^^^fc^'^fc') = 7Tr,er,S-^DSel + 7rr,er,S~'DSeJ^ , (4.11) 
fc^oo k 



where P = S ^AS is the standard diagonalization of P in Proposition □77 



and D = diag{— 1/2, A2/ (1 — A2), . . . , Xm/ (1 — Am))- That is, the asymptotic 
covariance matrix of (T^, T|, . . . , TJJ^) is given by 

E = U + U{S'^DS) + {S'^DSfU, (4.12) 
where U = diagij^i, 112, ■ ■ ■ , Tim)- 

Proof. Beginning with (14.71) . we diagonalize P and find that 



fc-i 



Qk = J2ik-i){S~'ASy 



i=l 

'fc-1 



^-Miag(/i(l), hi\2), hi\^))S, (4.13) 



where h{X) := ^2=ii''^ ~ k)\^. Now h{l) = k{k — l)/2 is quadratic in k, 
while for A 7^ 1, 

, ^ A(A^- - 1) 



(1-A) (i-xy ' 

so that h{X) is hnear in k. We thus can write Qk as the sum of terms which 
are, respectively, quadratic and linear in k. Recalling, moreover, that the 
first row of 5* contains the stationary distribution, and that the first column 
of is (1, 1, ... , 1)-^, we have 
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Qk = ^-Miag(/i(l), h{\2), hiXm))S, 
= ^5-Miag(l,0,...,0)^ 



1 A, 



1- A. 



S + o{k) 



( 



2 



TTl • • • TTm 

TTl vr2 • • • 7r„ 



(4.14) 



\7ri 7r2 • ■ ■ 

Starting with the variance in (14.81) . we now find that, for each 1 < r < m, 

Var = fcvTr — /c^TT,^ + l-KrCrQu^r 

= kiTr — k'^'nl + 2Ttr { ^ kcrS'^DSeJ j + o{k) 

= kTTr (1 + 2erS-^DSe^) + o{k), (4.15) 



from which the asymptotic result (14.101) follows immediately. 

An identical development shows that, for ri 7^ r2, (14. 9 p simplifies to 



Cov{Tl\T, 



r2^ 
k I 



k VTriVTrj + TTr-i [ -^'^r2 + kCr^S DSe 



1 no„T 

r2 



from which the asymptotic result (14.111) follows, and so does (I4.12p . 



(4.16) 



Remark 4.2 To see that ( 14.10p and ( 14. lip &ot/i recover the covariance re- 
sults for the ad case investigated by the authors in ^IS], let P he the transition 
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matrix whose rows each consist of the stationary distribution (tti, 7r2, . . . , 7r„ 
In this case A2 = ■ ■ ■ = = 0, and so D = diag{— 1/2, 0, . . . , 0). Hence, 



er,S''DSel = [1,*, ...,*) D (vr,,, *f 



for all Ti and r2, and so, for each r. 



k^oo k V V 2 

while, for r^. 



k " V 2 J " \ 2 

Note that, in the uniform iid case, we have tTj. = 1/m, for all 1 < r < m. 
Hence, for ri ^ r2, the asymptotic correlation between TJ} and T^^ is given 
by ( — l/(m^))/((l/m)(l — 1/m)) = — l/(m — 1), so that the covariance matrix 
is indeed the permutation- symmetric one obtained in the iid uniform case in 
[TBI . There is, moreover, another Brownian functional representation for the 
iid uniform case in JT^ in which the Brownian motions have a tridiagonal 
covariance matrix. 



5 The Limiting Shape of the Young Tableau 

Thus far, our results have centered on Lin alone, essentially ignoring the 
larger question of the structure of the entire Young tableau. The present 
section extends the combinatorial development of the previous section to 
answer the question of the limiting shape of the Young tableau. 

Our first result in this direction is a purely combinatorial expression 
generalizing (12.11) . It is standard in the Young tableau literature to have 
entries chosen from the set {1,2, ...,m}. Below, without loss of general- 
ity, we allow our entries to be chosen from the m-letter ordered alphabet 

Am = {ai< ■ ■ ■ < Otm}- 

Theorem 5.1 Let R^, . . . be the lengths of the first 1 < r < m rows 
of the Young tableau generated by the sequence (X/c)i<fc<„ whose elements 
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belong to an ordered alphabet Am = {«i < ■■■ < Om}- Then, for each 
1 < r < m, the sum of the lengths of the first r rows of the Young tableau is 
given by 

r r m—r+j 

0=\ 3=\ £=j 

where Jr,m = {{kj^£, 1 < j < r,0 < i < m) : kjj^i = 0, kj^m-r+j = n,l < j < 
'f') kj/-i < kj^i, I < j < r,l < i < m; kj^i < 2 < j < r,l < i < m}, and 

where al is the number of occurrences of ai among {Xi, X2, . . . , X^}. 

Proof. Recall that the sum of the lengths of the first r rows of the Young 
tableau generated by a sequence {Xk)i<k<n, whose letters arise from an m- 
letter alphabet, has an interpretation in terms of the length of certain increas- 
ing sequences. Indeed, the sum i?^ + + ■ ■ ■ + i?^ is equal to the maximum 
sum of the lengths of r disjoint, increasing subsequences of (Xfc)i<fc<„, where 
by disjoint it is meant that each element of {Xk)i<k<n occurs in at most one 
of the r subsequences. (See Lemma 1 of Section 3.2 in [13]). More general 
results of this sort, involving partial orderings of the alphabet and associated 
antichains, are known as Greene's Theorem [l7j. However, such results are 
not enough for our purpose. Below we need a different way of reconstructing 
disjoint subsequences. 

We begin by examining an arbitrary collection of r disjoint, increasing 
subsequences of {Xk)i<k<n, and show that we can always map these r subse- 
quences onto another collection of r disjoint, increasing subsequences whose 
properties will be amenable to our combinatorial analysis. 

Specifically, with the number of rows r fixed, suppose that, for each 1 < 
j < r, we have an increasing subsequence (-^^j )i<£<n of length uj < n, and 

k^ J 

that the r subsequences are disjoint. 

We first construct the new subsequence {Xh)i<£<n^ as follows. First, 

place all ais occurring among the r original subsequences into (Xiji<£<j^^, if 

k^ 

there are any. If the last ai occurs at the n*'^ index, then {Xh)i<i<fi-^ is com- 

k^ 

plete. Otherwise, place all which occur after the final ai into {Xh)i<£<nj^, 

k^ 

if there are any. If the last 0:2 occurs at the n*^ index, then (X^i)i<^<fj^ is 

k^ — — 

complete. Otherwise, continue adding, successively, as, . . . , am-r+i in the 
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same manner. Thus, {Xh)i<£<hi consists of a weakly increasing sequence of 
length Hi having values in {ai, . . . , am-r+i}- 

Next, we construct the new subsequence (X?2)i<£<n2 similarly. By con- 

£ 

sidering only those letters among the r original subsequences which have 
not already been moved to the first new subsequence, start with the small- 
est available letter, 0^2, and continue adding, successively, as, . . . , am+r-2- 
Note that, crucially, all a2S added to (X?2)i<£<n2 occur before the last in- 
dex at which ai was added to the first subsequence. More generally, each 
ttj, 2 < j < m — r + 2, added to (X|2)i<^<ri2 occurs before the last 

was added to the first subsequence. Thus, (-^r2)i<£<n2 consists of a weakly 
increasing subsequence of length h2 having values in {a2, ■ ■ ■ , am-r+2}- 

The construction of (-^^j)i<£<nj, for 3 < j < r, continues in the same 

manner, with {XL)i<e<n , constructed from among the entries of the r orig- 
inal subsequences which were not moved into any of the first j — 1 new 
subsequences, so that (-^rj)i<^<fij, consists of a weakly increasing sequence 

£ 

of length fij having values in {aj, . . . ,am-r+j}- It is possible that beyond 
some j > 2 the new subsequences may be empty. 

We claim that, indeed, the construction of the r^^ new subsequence ex- 
hausts the set of available entries. Indeed, without loss of generality, assume 
that after we have created the (r — 1)*'^ new subsequence, the set of avail- 
able entries is non-empty, and designate the location of the final to be 
included in the j^^ new subsequence by kj^£, for 1 < j < r and 1 < £ < m. 
(If no ai was available for inclusion, set kj^£ = where kj^ = 0, for 

all 1 < j < Clearly, all ai, a2, • • • , «r-i have been included in the first 
r — 1 new subsequences. If r = m, we are done: simply put the remaining 
a-rS into the r*'* new subsequence. If r < m, we may still ask whether there 
was, for some r + 1 < i < m, an ai from among the available entries which 
occurred before fc^/-!. Assume that there is such an a^. Now by construc- 
tion, kj-^-i^e^r+j < ^j.^-r+j-i, for 1 < j < r — 1. Hence, there exist letters 
aj-^ < aj^ < ■ ■ ■ < aj,. < ai-i among the original subsequences which oc- 
curred after kr/-i, and, moreover, each letter must come from a different 
subsequence. But since each original subsequence was increasing, none of 
them could have contained an ae before kr/-i, and we have a contradiction. 

To better understand this construction, consider the first row of Figure [H 
which shows an initial sequence of length n = 12, with m = 4 letters, broken 
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Figure 1: Transformation of r = 3 subsequences. 

into r — 3 disjoint, increasing subsequences of lengths rii = 3, n2 = 4, and 
ris — 3, and so with total length 10. The final three rows of the diagram show 
the results of the operations described above, producing 3 new increasing 
subsequences of length ni = 4, n2 = 3, and = 3. 

Hence, if we wish to find r disjoint, increasing subsequences whose length 
sum is maximal, it suffices to consider only those disjoint, increasing subse- 
quences for which the final occurrence of the letter ae in the subsequence i 
happens after the final occurrence in the subsequence j, whenever i < j. Be- 
cause such ranges do not overlap, if we wish to count the number of aes in the 
J*'* subsequence, it suffices to simply count the number of a^s in {Xk)i<k<n 
over that range. 

Indeed, returning to the fundamental combinatorial objects of our de- 
velopment, the a^, we see that since a-^ — counts the number of a^s in 
the range i + 1, . . . , k, we can describe the valid index ranges over which 
to search for the maximal sum as J^^m = {{kj/,^ < J < t, < £ < m) : 
kjj_i = 0, kj^m-r+j = n,l < j < r; < kj^i, I < j < r,l < i < m; kj^i < 
kj^i^i, 2<j<r,l<i< m}. The constraints on the kj^i follow simply from 
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j=l 1 2 3 4 



j=2 2 3 4 5 



j=3 3 4 5 6 



j=4 4 5 6 7 



k=l k=n 



Figure 2: Schematic diagram of Jr-,m, for r = 4, m = 7. 

the fact that each subsequence is increasing and that, moreover, the intervals 
associated with a given letter do not overlap. Figure [2] indicates the relative 
positions of each range, for r = 4 and m = 7. 

Since the first possible letter of each subsequence grows from ai to a^, 
and the last possible letter grows from am+r-i to a^, the result is proved. ■ 

We are now ready to apply our asymptotic covariance results (Theorem 
I4.ip . along with a Brownian sample-path approximation, to the combinato- 
rial expression fIS.ip . and so obtain a Brownian functional expression for the 
limiting shape of the Young tableau for all irreducible, aperiodic, homoge- 
neous Markov chains. 

Indeed, for each 1 < r < m, let 

r r m—r+j 

V; := E = , max E (<. " ' (5-2) 

3=1 ]=i e=j 

where the index set Jr^m is defined as in Theorem 15. 1[ Define as before 
T[ = Yli=i(Wi ~ ^r) = ctfe ^ '^rk, and so rewrite (15.21) as 



r m—r+j 



= , _max^ E E (K. + ^^^^■.^) - (^4.-1 + ^^^^-.^-^ 

r m—r+j 



max 

j=i i=j 
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Next, let r be a permutation of the indices 1, 2, . . . , m such that 7r^(i) > 
'^t{2) > • ■ ■ > 7i"r(m) > 0. Moreover, we demand that if HT-[i) = tTtU) for 
i < j, then r(i) < r(j). (The permutation so defined is thus unique.) Let 
Ur = X]j=i ^r(j) be the sum of the r largest values among tti, 7r2, . . . , vTm- 
We obtain, below, the limiting distribution of (V^ — Uru) / y/n as a Brownian 
functional. 

To introduce Brownian sample-path approximations, and for each 1 < 
r < m, we first define the asymptotic variance of as in (14.101) . by 

al := hm = e.Ee^, (5.4) 

n^oo n 

and, for ri 7^ r2, the asymptotic covariance of T^^ and T^^ by 



a.,., := hm ^"^^^"'^"^ = e.,Ee^^, (5.5) 

n— »oo n 

where E is the covariance matrix of Theorem 14.11 associated with the transi- 
tion matrix P. For each 1 < r < m, we then let 

ar.\/n 



for < t < 1. This rescaling of [0,n] to [0, 1] calls for us to define a new 
parameter set over which we will maximize a functional arising from the 
expressions in (15.61) . Indeed, for any positive integers s and d, with s < d, 
define the set 

Is,d = |(tj,£, I < j < s,0 < i < d) :tjj„i = 0, tj^d-s+j = 1, 1 < J < s; 

tj,e-i ^ tj/, i ^ j ^ s,l < i < d; 
tj,i<tj.i^e,2<j<s,l<£<dy 

Note that the constraints tjj-i = and tj^d-s+j = 1, for 1 < j < s, force 
many of the tj^e to be zero or one. We will denote the s x [d + l)-tuple 
elements of Is,d, by (t.,.). Figure [3] shows the structure of Is,d, for s = 4 and 
d = 7. The locations of tj^£ are indicated by the horizontal lines within the 
diagram. 

With this notation, (15.31) becomes 
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Figure 3: Schematic diagram of Is,d, for s = 4, ci = 7. 




r m—r+j 



max 

(«.,.)G-fr, 



r m—r+j 



Our analysis of fl5.7l) will yield the following theorem, whose proof we de- 
fer to the conclusion of the section. This theorem gives, in particular, a full 
generalization of the limiting shape of the Young tableau in the non-uniform 
iid case. 

Theorem 5.2 Let {Xn)n>o be an irreducible, aperiodic, homogeneous Markov 
chain with finite state space Am = {^i < ■ ■ ■ < am}, transition matrix P, 
and stationary distribution {7Ci,tt2, . . . ,7!'m)- Let S = {(yr,s)i<r,s<m be the as- 
sociated asymptotic covariance matrix, as given in (14.121) . and let the law 
of Xq be given by the stationary distribution. Let t be the permutation of 
{1,2,..., m} such that Trr(i) > vrT-(j+i), and T{i) < T{j) whenever 'nr{i) = T^T{j) 
and i < j. For each 1 < r < m, let be the sum of the lengths of the first 
r rows of the associated Young tableau, and let v^. = X]j=i^r(j)- Finally, let 
dr be the multiplicity ofT^rir), o,nd let 




29 



Then, for each 1 < r < m, 



^ 1=1 

r-rur {dr+rrir-r+j) 

+ max ^ ^ (B<^r+^'> (t,,,) - (t,- 

J-r — rrir .dr . . . ^ 



(5.8) 



where the first sum on the right-hand side of f l5.8p zs understood to be 0, if 
rrir = 0. Above, = ar,r, o-nd {B^{t), B'^{t), . . . , B"^{t)) is an m- dimensional 
Brownian motion, with covariance matrix S = {o^r,s)i<r,s<m given by 

(cTr-.s) = t{crr,s)/(TrCrs, (5.9) 

for 1 < r, s < m. Moreover, for any 1 < k < m. 

Remark 5.1 The critical indices dr and rrir in Theorem \5.2\ are chosen so 
that 

Thus, the functional in (15. Sp consists of a sum of rrir Gaussian random vari- 
ables and a maximal functional involving only dr of the m one- dimensional 
Brownian motions. 

Remark 5.2 Another, more natural, way of describing the covariance struc- 
ture of the m- dimensional Brownian motion in Theorem \5.2\ is to note that 
{criB^{t), cr2-B^(t), . . . , (JmB^{t)) has covariance matrix tT,. 

Let us now examine the case r = 1. Here, as previously noted, = Lin. 
Since mi = 0, (15. 8p becomes 



Lin '^max^ 



Zp^^ n,ax Va.(,)(s^W(ti,,)-5-W(ti,_i)), (5.11) 
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where we have written vr^ax for 71^.(1) . The functional in flS.lip is similar to 
the one obtained in the iid case in [IB], the essential difference being, not in 
the form of the Brownian functional, but rather in the covariance structure 
of the Brownian motions. 

To see precisely where this difference comes into play, note that if the tran- 
sition matrix P is cyclic, then the covariance matrix of the Brownian motion 
is also cyclic. Consider then the 3-letter aperiodic, homogeneous, doubly- 
stochastic Markov case. Since the Brownian covariance matrix is symmetric, 
and, moreover, degenerate, an additional cyclicity constraint forces it to have 
the permutation-symmetric structure seen in the iid uniform case. In partic- 
ular, Lin will have, up to a scaling factor, the same limiting distribution as 
in the iid uniform case: 



Lin - n/3 



n 



a max 

(i.,.)6/i.3 



(5.12) 



where a = ai, for all 1 < £ < 3, and with the Brownian covariance matrix 
given by 

/ 1 -1/2 -l/2\ 
S = t -1/2 1 -1/2 , 
V-1/2 -1/2 1 / 

and where we have used the fact that r(£) = £, for all 1 < £ < 3. 

However, when m > 4, the cyclicity constraint does not force the Brow- 
nian covariance matrix to have the permutation-symmetric structure, as the 
following example shows for m = 4. 

Example 5.1 Consider the following doubly-stochastic, aperiodic, cyclic tran- 
sition matrix: 



P 



/0.4 
0.1 
0.2 

\0.3 



0.3 
0.4 
0.1 
0.2 



0.2 
0.3 
0.4 
0.1 



0.1\ 
0.2 
0.3 
0.4/ 



(5.13) 



While the doubly-stochastic nature of P ensures that the stationary distribu- 
tion is uniform, the covariance matrix of the limiting Brownian motion, at 
three- decimal accuracy, is computed to be 
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/ 1.000 -0.357 -0.287 -0.357\ 

~_ -0.357 1.000 -0.357 -0.287 , . 

-0.287 -0.357 1.000 -0.357 ' ^ ' 

\-0.357 -0.287 -0.357 1.000 / 

and al = cr^ := 0.263, for each 1 < r < 4. Thus, the limiting distribution of 
Lin is given by 



" => CT max y2(B\t,,)-B\t,,_S). (5-15) 

(i.,.)6/i,4 "j-^.y J 

for 1 < r < 4. However, while the form of the functional is the same as 
in the iid uniform case (up to the constant), the covariance structure of the 
Brownian motion in (15.141) differs from that of the uniform iid case, i.e., 
from 





1 


-1/3 


-1/3 


-l/3\ 




-1/3 


1 


-1/3 


-1/3 




-1/3 


-1/3 


1 


-1/3 


V- 


-1/3 


-1/3 


-1/3 


1 / 



(5.16) 



and so the limiting distribution in (I5.15P is not that of the uniform iid case. 



We thus see that Kuperberg's conjecture regarding the shape of the Young 
tableau for random sequences generated by aperiodic, homogeneous, and 
cychc matrices [24] is not true for general m-alphabets. By simply extending 
the first-row analysis above to the second and third rows, we see that it is 
true for m = 3. However, as could have been anticipated by (15 . 1 21) . it fails 
for m > 4, as the previous example showed. Furthermore, in the next section 
we shall see that for the cyclic case the structure of S can be described in an 
elegant manner which more clearly delineates when we obtain the uniform 
iid limiting law. 

In the more general doubly stochastic case, we have the following corol- 
lary: 

Corollary 5.1 Let the transition matrix P of Theorem \5.S\ be doubly stochas- 
tic. Then, for every 1 < r < m, rrir = 0, dr = m, and 
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-\/r _ I m-r+j 

" r ^ max {B'ih^) - B\h^-^)) ■ (5-17) 

If, moreover, the matrix P has all entries ofl/m (i.e., in the iid uniform 
alphabet case), then 



V^ — rn/m \Jm — \ 



r m—r+j 



„ E (S'M - ■ (5.18) 

j=l £=j 

and the covariance matrix in (15.91) has all its off-diagonals equal to — l/(m — 
l). 

Proof. For each 1 < r < m, tt^ = 1/m, and so = r/m, rrir = 0, and 
the multiphcity dr = m. Moreover, the permutation r is simply the identity 
permutation. This proves (15.171) . If, moreover, all the transition probabili- 
ties are 1/m, then the multinomial nature of the underlying combinatorial 
quantities a^. tells us that = (1/m) (1 — 1/m), for each 1 < r < m, and 
that Pri,r2 = — l/(m — 1), for each ri ^ r2, thus proving (I5.18p . ■ 

To see that the functional in (15.171) is generally different from the uniform 
iid case, even for m = 3, consider the following non-cyclic example: 

Example 5.2 Let a doubly- stochastic (but non-cyclic), aperiodic Markov 
chain have transition matrix 



(5.19) 

As in Example \5.1\ the doubly-stochastic nature of P ensures that the sta- 
tionary distribution is uniform. In the present example, the asymptotic co- 
variance matrix, at three- decimal accuracy, is computed to be 

0.459 0.049 -0.506^ 

0.049 0.086 -0.136 | . (5.20) 
-0.506 -0.136 0.642 
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Note that, even though we have a uniform stationary distribution, the asymp- 
totic variances (i.e., the diagonals of fl5.20p ) have dramatically different val- 
ues. Moreover, according to Remark \4.S\ in the uniform iid case, the only 
possibility for the Brownian covariance matrix is that the off- diagonals have 
value —1/2. However, the Brownian motion covariance matrix obtained from 
(^M) IS 

/ 1.000 0.246 -0.935\ 
t 0.246 1.000 -0.577 . (5.21) 
\-0.935 -0.577 1.000 / 

Not only are the off-diagonals different from —1/2, but in some cases are even 
positive. In short, the functional in (15.1 7p has a distribution which differs 
from any iid case (even non-uniform). 

Remark 5.3 Generalizing a result of Baryshnikov ^ and of Gravner, Tracy, 
and Widom fW^ on the representation of the maximal eigenvalue of anmxm 
element of the GUE, Doumerc \TW found a Brownian functional expression 
for all the eigenvalues of an m x m element of the GUE. Our expression 
in (I5.18P is similar, with the exception that our m- dimensional Brownian 
motion is constrained by a zero-sum condition, and, moreover, has a dif- 
ferent covariance structure. (We note, moreover, that the parameters over 
which his Brownian functional is maximized in [W^ might be intended to 
range over a slightly larger set which corresponds to our Ir,m-) Using a path- 
transformation technique relating the joint distribution of a certain transfor- 
mation of n continuous processes to the joint distribution of the processes 
conditioned never to leave the Weyl chamber, O'Connell and Yor l2Ef em- 
ployed queuing-theoretic arguments to obtain Brownian functional represen- 
tations for the entire spectrum of the m x m element of the GUE. In a study 
of much more general transformations of this type, Bougerol and Jeulin ^ 
were able to obtain this result as a special case. 

If dr = 1, i.e., if the r^^ most probable state is unique, then the following 
result can be viewed as lying at the other extreme from Corollary 15.11 

Corollary 5.2 Let 1 < r < m, and let dr = I in Theorem \5.2[ Then 



n . ^ 

1=1 



5^a.(,)fi^«(l). (5.22) 
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Proof. If dr = 1, then = r — 1, and so the maximal term of fl5.8l) contains 
only one summand, namely ar{mr+i)B'^^'^''^^K'^) = <^r(r)-B'^^''''(l). Including 
this term in the first summation term of (15.81) proves (I5.22p . ■ 

Remark 5.4 The maximal term of the functional in (15.81) is that of the 
doubly-stochastic, d^-letter case. Indeed, the maximal term involves precisely 
dj. Brownian motions over the r — rows. Such a functional would arise in 
a doubly- stochastic dr-letter situation with a covariance matrix consisting of 
the sub-matrix of the original S corresponding to the dr Brownian motions, 
as in Corollary \5.1[ The Gaussian term corresponds to the functional of 
Corollaru \5.B . That is, in some sense, the limiting law of ( \5.8\} interpolates 
between these two extreme cases. 



Proof. (Theorem 15. 2p Since the r = 
cally equal to n), assume that r < m. 



m case is trivial {V^ is then identi- 
Recall the approximating functional 



r m—r+j ^ 
j=l £=j J 

Introducing the notation Atj^i := [tj^i_i,tj^i_i] and M^(Atj^^) := M^^itj^i) — 
M^{tj^e-i), for any m-dimensional process M{t) = (M^(t), M^{t), M'^lt)), 
t G [0, 1], we can rewrite (15.231) more compactly as 

yr ^ ^ r ^ m—r+j r m—r+j >. 

^S=^ = 7^^ E E ^^BiiAt,,,) - E i^rU)-ne)\At^,A- 

V '<j = l i=j j = l i=j } 

(5.24) 

The main idea of the proof to follow will be to show that the second 
summation of (I5.24p can, in effect, be eliminated by choosing the (Atj^^) 
in an appropriate manner. Now some of the coefficients (7rT-(j)_7rJ are zero; 
such terms do not cause any problems. Intuitively, however, the remaining 
terms should have |Atj^^| = 0. Defining the restricted set of parameters 
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^r,m = ii'tj/) e Ir,m ■ Ei=i YlT=i^\^i " ^r(i) ) | Atj- £ | = 0, 1 < £ < m}, we See 
that, provided 1*^7^^, 



r m—r+j 



j=i i=j 

r m—r+j 

>max^ E (5.25) 

Moreover, by the Invariance Principle and the Continuous Mapping The- 
orem, 

r m—r+j r m—r+j 

max^ Y (TeBiiAtjj) ^ maxY Y (^eB^Atj^e)- (5.26) 
We claim that, indeed, I*^ ^ 0, and that, moreover. 



max^ E (aiBi{Atj^e) - Vn (tt^o) - tt^) \ Atj^e\] 
j=i i=j ^ ^ 

r m—r-j-j 

^max^ Y creB^{Atj,e). (5.27) 

We will prove that 7*,^ 7^ by creating a bijection between and Ir-mr,dr- 
To this end, for 1 < i < m^, let lT(i),i = [uT{i),i-i,Ur{i),i] = [0,1]. Next, 
choose any (m.^.) G Ir-mr,dr^ ^"^^ define further intervals Ir{mr+j),t = ^'Uj/, 
for 1 < j < r — rrir and 1 < i < dr. 

We now create a partition of these intervals in a manner which relies 
on the ideas used in the proof of Theorem 15. 1[ Consider the set of points 
{ujA (i<j<r-mr,i<£<dr), and order them as Sq := < Si < • ■ ■ < s^-i < : = 
1, for some integer k, and let Asg = [sg-i, Sg], for all 1 < g < k. 

Trivially, for each 1 < q < and for each 1 < i < rrir, Asq C lT(i),i- 
Moreover, for each 1 < j < r — m^, there exists a unique £(j, q) such 
that Asq C lT{mr+j),(.{j,q)- Fo^ 6ach g, consider the set of indices Aq := 
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{r(l), . . . , T^nir)} U {r(mr + £(1, g)), . . . , r(mr + £{r — m^, (?))}, and order 
these r elements of Ag as 1 < £{1, q) < ■ ■ ■ < £{r, q) < m. 

Using these partitions, we examine, with foresight, the following func- 
tional of a general m-dimensional process {M{t))t>o'- 



rur (r-irir) {i — rra,.+rfr — 1) 

^Ar»(l)+ J2 Yl M^^"''-+^\Au,,^) (5.28) 

1=1 j=l £=j 



rrir I K 



i=\ \q=\ 

(r—nir) (l — mr+dr~l) 

K / rrir [r-rrir) 

<?=! yi=i j=i 

9=1 j = l j = l q=l 

= EE^'^'"'^(At,/), (5.29) 
j=i i=i 

where, for each 1 < j < t, and for each 1 < £ < m, tj^i := max{sg : £ > 
^(i; ?)}• (That is, for each j, we collapse together intervals Asg corresponding 
to the same component M^.) Now, since our functional in (15.291) has non- 
trivial summands only for £ such that 7rT-(f) > 7r^(r), we have shown that 

The following example illustrates this argument. Suppose we have an 
alphabet of size m = 8, with 

(tti, tts, . . . , TTg) = (0.07, 0.1, 0.2, 0.06, 0.2, 0.06, 0.1, 0.2). 

Then, 

7rr(i) = 71t(2) = 71t(3) = 0.2, nil = m2 = nis = 0, di = d2 = = 3, 
7rr(4) = 7r^(5) = 0.1, mi = m^ = 3, di = d^ = 2, 
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1 


2 


3 


4 


5 


2 


3 


4 


5 


6 


3 


4 


5 


6 


7 


4 


5 


6 


7 


8 



Figure 4: A typical element of I^^s- 

7Tr(6) = 0.07, rriQ = 5, 4 = 1, 

7r^(7) = 7rT-(8) = 0.06, rriY = mg = 6, dr = ds = 2. 

In particular, note that the two largest, distinct probability values are 
0.2 and 0.1, of multiplicities 3 and 2, respectively. Next, consider the case 
r = 4. We now show how Ir-mr,dr = -^4-3,2 = h,2 corresponds to an element 
of J*^ = II g. Figure m shows a typical element of the unconstrained index 
set I^^s- 

Now r(l) = 3,r(2) = 5,r(3) = 8,r(4) = 2, and r(5) = 7. Our construc- 
tion begins with the amalgamation of = m4 = 3 rows, corresponding to 
the three indices for which vTj is strictly less than 'n'r(r) = T^riA) = 0.1, with 
Ji^2- This is shown in Figure O 

Finally, we simply reorder each vertical column in the original order of the 
indices, as shown in Figure [61 We see that, first of all, we have constructed 
an element of /4^8- Moreover, since we have three rows whose indices are 
associated with the maximum value, and a remaining row whose indices 
are associated with 7Tr{A), we indeed have an element of I^^. Note that the 
4 X 4 = 16 free indices in I^^s (corresponding to the locations of the 16 vertical 
bars in Figure H]) have been reduced to a single one in I^^. 

In addition, we may essentially reverse this construction, starting with 
an element of /*^ (7^ 0), and so obtain an element of Ir-mr,dr- Indeed, from 
the definitions of I*^ and z/^ we know that 
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Figure 5: Amalgamating 3 rows with Ii^2- 





Figure 6: Reordering vertically to obtain an element in 
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r r m—r-\-j 

j=i j=i e=j 

for any (t,^,) G J*^. However, we also have 

r m—r+j r m—r+j 

j=i e=j j=i e=j 

r m—r+j 

+ Y '^{^T(i)<^r(mr)}'^i\^tj/\ 

j=i e=j 

r m—r+j 
+ l{mr=0}T^r{l)Y Y l^^i'^l 

j=l e=j 

< l{m,>0}((7rr(l) H h 7r^(mO) + {r - mr)7Tr(r)) 

+ l{m,=0}'^7r^(l) 
= ^r, 

with equahty holding throughout if and only if = or rrir > and 
Yl]=i l^^i.^l ~ ^ such that TCr{e) > '^T{mr)j ^"^^ that, moreover, 

Z]j=i YJi=j"^^ l{7r,(Q=7r,(, )}|At j-£| = r - m^. If > 0, then, for any (t. J e 
J*^, we may start with (15.291) . and use again the permutation of the indices 
employed there. We thus obtain the first term of fl5.28p . which corresponds 
to the condition Yl^j=i l-^^j/l = 1; ^ such that 7r^(£) > i^rimr)^ ^-iid 

also the second term of (15.281) . which corresponds to the other condition 

E5=iE^7^^ ■'-{'^-w='^-w}I^^J'^l ^ r-nir- If = the same reasoning 
holds, except that the first term in (I5.28P is taken to be zero. 

Having thus established a bijection between I*^ and Ir-mr,dri we may thus 
maximize over these two parameter sets, and so, for any process {M{t))t>Qi 
obtain the general result 

mr (r-mr) (r-mr+dr-l) 

Xm^»(1)+ max Y ^"^"''^^n^M 

i = l Ir-^r.dr .^^ 

r m—r+j 

= maxX J2 M^^^^^\Atj^e)- (5.30) 
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We now proceed to show that (I5.27P holds. First, fix c > 0, and, for each 
1 < ^ < m, set 

ce=h if^^<^^«' (5.31) 
I 0, otherwise. 

Next, let Ml{t) = (jSi{t) - cet, and let M^{t) = a^B^t) - Qt. Then, 
for n large enough, namely, for n > c/{Tir{r) — T^rir+i))^ "we have that, almost 
surely, for any t,^. G /r,m, 



r m—r+j 

j=i 1=1 

r m—r+j 

>J2Y1 (^^^n(At,/)-v^(7r,(,)-7r,)|At,-,|). (5.32) 
Hence, almost surely, both 

r m—r+j 



-* r.m 

j=i 1=1 

r m—r+j 

> max^ ^ (aiB^{Atj^i) - y/n [11^^) - iri) \Atj^e\] , (5.33) 



and 



r m—r+j r m—r+j 



Now choose any z > 0. Then 
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. r m—r+j 

P( max (aeB^^{Atj^e) - ^ (7r^(j) - vr^) | Atj,^ 

\ i-r.ra V 

r m— r+j r m— r+j 

< P I maxj^ 5^ ii? (At,,,) - max^ ^ M^(At,,,) > z ) , (5.35) 



i = l £=1 ' j = l £=1 



SO that 



limsupPi max (aiB^{Atj^i) — \fn (vr^(j) — vr^) |Atj, 

^ J = l £=J 

r m— r+j n 

-max^ 5Z ^^^n(Asi,£) > 2;) 

(r m—r+j r m—r+j 

max^ 5Z ^"(Aij/) -max^ ^ M^(Atj-£) > z J 
j=i 1=1 j=i t=i ' 

r m—r+j r m—r+j \ 

max^ Y M\Atji) - maj^Y ^ M\Atj^t)> z \ , (5.36) 
j=i 1=1 ^ 3=1 i=i ' J 

by the Invariance Principle and the Continuous Mapping Theorem. Next, 
for any < £ < 1, let 

Ir,mi£) = {{tj,e) e Ir;m ■ ^ \Atj^(,\l{^^^^^^^^} < £r} . 
Thus, /*„j = /r,m(0) C Ir,m{£) C Ir,miX) = Ir,m- We bouud (I5.36P USiug this 

family of subsets as follows: 



P 



r m—r+j r m—r+j 

j=i i=i j=i e=i 
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(r m—r+j r m—r+j 

J=l ^=1 J = l £=1 

(r m—r+j r m—r+j 

lr.m\ir,m(S.) . -'r.m . -, , 

J = l £=1 J = l £=1 

r m—r+j r m—r+j 



(r m—r+j r m—r+j 

max E E B^{Atj^e) -maxY^ ^ B^Asj^e) > z + ere 
Ir.mVrMe) .^^ £=1 ' j = l i=l 

(r m—r+j r m—r+j 

max E Y B^iAtj^e) -maxY Yl B^Asj^e) > 
J = l £=1 J = l £=1 

(r m—r+j r m—r+j ^ 

max^ Y B^iAtj^e) -rnaxY Y B^Asj^e) > z + ercY 
j=l £=\ j = \ £=1 / 

(5.37) 

We can now take the limsup in (15.371) . as c — oo, and then, as £ ^ 0, 
and so estabhsh convergence to zero in probabihty. Moreover, since 



r m—r+j r m—r+j 



P 



max^ Y M\Atj^e)-maxY Y ^^(^^j/) > = 1, 



j=l £=1 • j=l £=1 



we have in fact shown, with the help of (15.361) . that with probabihty one, 

r m—r+j r m—r+j 



max^ Y M\Atj,i) = m.ax.Y Y ^^(^^i.^ 

and thus 



j=l £=1 j=l £=1 



max Y \(^£Bn{Atj^f) - ^ (7r^(j) - tt^) \AtjM 

J-r.m \ / 

j=l £=j 

r m—r+j 

-max^ Y (TeBiiAs^^e) ^ 0. (5.38) 
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Since 



r m—r+j r m—r+j 

maxj^ S (^eB'jX^Sj/) ^ max^ ^ ai^B^Asj^i), (5.39) 

' 

by the Converging Together Lemma, we have proved (15.271) . Equation (15.81) 
of the theorem follows from the bijection between I*^ and Ir-mr,dr described 
in the general result (I5.3UI) . 

Finally, we can obtain the convergence of the joint distribution in (15.101) 
in the following manner. Given any {61,62, ... ,6r) G M*", we have 



k=l 



k 

I ^ 1 

k 



r y k m—k+j 

^ 6k ( max ^ ^ (creBi{Atj^i) - y/n [nr^j) - ire) \ Atj^i\ ] 
k=i ^^"-"^ j=i i=j ^ ■ 

1" / k m—k+j 

^ 6k ( inax ^ ^ (aiB^iAtj^e) - Vn (tt^q) - tt^) | At^- J 
k=i ^ j=i e=j 

k m-k+j \ ^ / ^ m—k+j s 

7 = 1 i=j ' k=l ^ 7=1 £=7 ^ 



Now from (I5.38p . the first summation on the right-hand side of (I5.40p con- 
verges to zero in probability, as — > cxd. Moreover, the second summation is 
a continuous functional of (5^, B'^, . . . , B^), and so, by the Invariance Prin- 
ciple and Continuous Mapping Theorem, converges. Then the Converging 
Together Lemma, along with the bijection result (15.301) . gives 



f / k m—k+j s r 

^J^^JmaxJ^ a,B\As,A =Y,(^kV^- (5-41) 

k=l ^ j=l £=j ^ k=l 
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Since f l5.4ip holds for arbitrary {9i, 62, ... , Or) G W , by the Cramer- Wold 
Theorem, we have the joint convergence result (IS.lOp . ■ 

Since the shape of the Young tableau is more naturally expressed in terms 
of the B!^, rather than of the V^'^, we may restate the results of the previous 
theorem as follows: 

Theorem 5.3 Let (X„)„>o he an irreducible, aperiodic, homogeneous Markov 
chain with finite state space Am = {«! < ■ ■ ■ < and with stationary 

distribution (tti, 7T2, . . . , TTm)- Then, in the notations of Theorem \5.Sl 

f Rl-7Trii)n Rl-Trr(2)n R^^7r^^\ 

(5.42) 

where 

di 

Rl = max f^^^'H^i/) " B^'^'\h^e-i)) , (5.43) 

and, for each 2 < k < m, 

i=mfc_i+l 

k-mk {dk+rrik-k+j) 

+ max E ^ri^k^i)B^^"''-''\^t,,e) 

J = l i=J 

fc-l-mfe_i (dfe_i+mfc_i-fe+l+j) 



E E a.(.,_,+.)5^("-^+^)(At,, 

(5.44) 



max 2^ 2^ ^ --^ ■ '^ZAtj-^;, 

11,111,1 J — I 



where we use the notation B^{Atj^e) = B'^{tj^e) — B''^{tj£^i), for any 1 < s < 
m, 1 < j < k, and 1 < i < m, and where the first sum on the right-hand 
side of (15.441) is understood to be 0, if ruk = mk-i- 
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Proof. First, i?^ = V^, and, for each 2 < k < m, = - ^ 
Expressing these equahties at the multivariate level, we have 



m) 



n 



n \ n \/n 



n y/n y/n 

yfn ' " ' ' yfn 

(Ki,\/l,...,O-(0,Ki,---,O 

(5.45) 



where the weak convergence follows immediately from the Continuous Map- 
ping Theorem, since the transformation is linear. 

Equations (15.431) and fl5.44p follow simply from the Brownian expressions 
for (1/^,1/^,..., V^) in Theorem [E2 ■ 

If all m letters have unique stationary probabilities, then we have the 
following corollary to Theorem 15.31 

Corollary 5.3 // the stationary distribution of Theorem \5.3\ is such that 
each Tir is unique, then 



Rl - 77^(1)^1 Rl - 7lr(2)n Rn - T^T{m)n 



n \ n \ n 



iV((0,0,...,0),S). 

(5.46) 



In other words, the limiting distribution is identical in law to the spectrum 
of the diagonal matrix D = diag{Zi, Z2, . . . , Z^}, where {Zi, Z2, . . . , Z^) is 
a centered normal random vector with covariance matrix S. 

Proof. Now, for all 1 < k < m, dk = 1, and = A; — 1, so that 
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= a,(i)5^W(l), 
and, for each 2 < k < m, 



mfc 



i=TOfc_l + l 

k-ruk {dk+ruk-k+j) 

+ max J] ^r(m,+i)B^^'^'^'\Atj,e) 

k-l-m^_i {d^_i+mk-i-k+l+j) 

- max Yl E ar^m,_,+e)B<-^-^+'\At,,e) 

i=fc-l 

1 i 

+ max V V a,(,_i+,)5-(^-i+^) (At,,,) 

^1.1 — 1 1 — 
j=i e=j 

-rns^YY ^rik-2^e)B^^'-'^'^ (At,,,) 

= a,(fc_i)B-('=-i)(l) + a,(fc)5^W(l) - (7,(fc_i)5^(^^-i)(l) 
= a.(fe)5-('=)(l)- 

Moreover, the joint law resuh for {Rj^, i?^, . . . , K^) holds as well, and this 
is clearly a multivariate normal distribution, with mean (0,0,..., 0) and co- 
variance matrix E. Since the spectrum of a diagonal matrix consists of its 
diagonal elements, the final claim of the corollary holds. ■ 

Remcirk 5.5 We know that the joint law of (i?^, -R^, . . . , R^) in the iid 
uniform alphabet case is identical to the joint law of the eigenvalues of an 
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m X m traceless GUE matrix. Corollary 15.51 also gives a spectral charac- 
terization for the unique probability case, in particular, for a non-uniform 
ad alphabet with unique stationary probabilities. This is consistent with the 
characterization of the limiting law of Lin in the non-uniform iid case, due 
to Its, Tracy, and Widom fW[ l21^ . as that of the largest eigenvalue of the 
block associated with the most probable letters among a direct sum of inde- 
pendent GUE matrices whose dimensions correspond to the multiplicities dr 
of Theorems 15.^1 and \5.3[ subject to the condition that YlT=i ^/'^T{r)Xr = 0, 
where Xi, X2, . . . , are the diagonal elements of the random matrix. 

Remark 5.6 The difference between the zero-trace condition Yl^=i = 
and the generalized traceless condition YlT=i y/'^T{r)Xr = amounts to noth- 
ing more than a difference in the choice of scaling for each row K'^. We will 
find it more natural to express our results using the normalization associated 
with the zero-trace condition YlT=i -^r = 

6 Fine Structure of the Brownian Functional 

So far, we have seen that the hmiting shape of the random Young tableau 
generated by an aperiodic, irreducible, homogeneous Markov chain can be 
expressed as a Brownian functional. The form of this functional is similar to 
the iid case; the only difference is in the covariance structure of the Brownian 
motion. We begin our study of the consequences of this difference. 

In the iid uniform m-alphabet case, Johansson [22] proved that the limit- 
ing shape of the Young tableau had a joint law which is that of the spectrum 
of an m X m traceless GUE matrix. An immediate consequence of this result 
is that the limiting shape of the Young tableau contains simple symmetries, 
e.g., for each 1 < r < m, R"^ = —K^"^. Now, as was seen in Corollary 15. II of 
Theorem 15.21 the form of the Brownian functional in the doubly stochastic 
case involved only the maximal term. We will see that that there is also a 
pleasing symmetry to the limiting shape of Young tableaux in the doubly 
stochastic case by examining a natural bijection between the parameter set 
Ir,m and Im-r,m, foT any 1 < r < m — 1. Indeed, this result will follow as a 
corollary to the following, more general, theorem: 

Theorem 6.1 The limiting functionals of Theorem \5.2\ enjoy the following 
symmetry property: for every 1 < r < m — 1, 
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i=l 



r-rrir (rrir+dr-r+j) 



+ 



max 



m 



C 



+ 




r—rrir+j 



J2 cr,im.+i)B-^"'^^'\Au,,,), (6.1) 



where B^{A) := B^{t) — B^{s), for A = [s,t], and where the non-maximal 
terms on the left and right-hand sides of (16.11) are identically zero ifnir = 0, 
or rur + dr = m, respectively. 

Remark 6.1 Recall that, from the definitions ofrrir anddr, the non-maximal 
summation terms on the left and right-hand sides of (16.11) reflect the letters 
which have, respectively, greater and smaller stationary probabilities than 
7Tr{r)- Rccall, morcovcr, that the maximal terms are associated with the in- 
dices having the same stationary probability as T^T{r) ■ The maximal term on 
the left-hand side of (16.11) involves a summation over r — rrij. rows, while the 
one on the right-hand side involves m^+i — r rows. Thus, in a sense, the two 
maximal terms in (16.11) split dr = rrir+i — rrir rows between themselves. In 
summary, the functional on the right-hand side of (16.11) corresponds to the 
sum of the m — r bottom rows of the Young tableau. 

Proof. Without loss of generality, we may assume that r(j) = j, for all 
1 < j < m. Fix 1 < r < m — 1, and for any point t in the index set 

Ir-mr,dr, define Atj+rnr,e = [tj/-i,tj,e]y for 1 < j < r - and 1 < £ < dr. 
Furthermore, for each 1 < j < rrir or m^+i < j < m, set Atj £ = [0, 1], for 
j = i, Atj^i = {0}, for < £ < j, and At^-^ = {1}, for j < £ < m. Next, as 
in the proof of Theorem 15. 2[ consider the set of points {tj/}{i<j<r-mr,i<i<dr), 
and order them as Sq := < Si < ■ ■ ■ < s^-i < := 1, for some integer k, 
and let Asq = [sg-i, Sq], for each I < q < k. 

Now, for each 1 < g < k, let Aq consist of the indices £ for which As^ fl 
Atj^i 7^ 0. Then, almost surely. 
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nir r—rrir {rrir+dr—r+j) 

i=l j=l i=j 

T rn 

3=1 e=i 

r K m 

= EEE^^^'(^*i.«^^^«) 

j=i 5=1 e=i 

= EEE^^^'(^^'^)- (6-2) 

j=l q=l i^Aq 

Now by the "stairstep" properties of /r,m there are precisely r elements in 
each Aq. Letting Aq = {1, . . . , m} \ Aq, for each 1 < g < we thus see that 
each Aq contains exactly m — r elements. Let ij^q be the j^^ smallest element 
of Ag. We claim that for each 1 < j < m — r, the sequence £j^i, £j^2, ■ ■ ■ , £j,K- 
is weakly decreasing. 

Indeed, fix 1 < j < m — r and 1 < g < /t — 1, and suppose that q is 
less than all the elements of Aq. Then, by the properties of Ir,mi the least 
element of Aq^i is no smaller, so that the j*'* smallest element of Aq, ij,q+i 
is also ij^q. Next, suppose that ij^g is greater than k > 1 elements of Ag. 
Thus, £j_q = j + k. Then there are at most k elements of Ag^i which are less 
than or equal to ij_g, by the properties of 7^,^. But this implies that there 
are at least j elements of Ag^i which are less than or equal to ij^g. Thus, 
^j,q+i < ^j,q, and the claim is proved. 

Moreover, since each Ag contains {1,2,..., m^}, we see that necessarily 
each Ag contains {m,, + rf^ + 1, rrir + dr + 2, . . . , m}. 

For each 1 < j < m — r, we may now amalgamate the intervals Asg to 
obtain a partition of the unit interval. Specifically, for each 1 < j < m — r, 
and each 1 < £ < m, let uj^i be the smallest Sg such that ij,q+i < i. (We 
define ujfi = 1, for all 1 < j < m — r.) 

Finally, and most crucially, recall that YlT^i'^^-^^i^) ~ ^' ^'-'^ ^- Then 
since {B\ B^,..., B"") = {-B\ -B^, -B""), 

±±Y^a,B\Asq) 

j=i q=i eeAg 
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m—r K 

j = l g=l ^g^^ 

m mr+dr—r m 

i=mr+dr + l j = l (.= 1 

m mr+dr—r m 

^ Y ^^B\l)+ 5^an^.+.^-'■+^(A«,,,), (6.3) 

where Am^^ = [uj^i^i,Uj^i]. But, by the way we ordered each Ag, we must 
have Auj^^e fl Auj^/ = 0, for any ji ^ j2- Thus, u G /m,+d,-r,d,, and so we 
may restrict the summation over i in (16.31) to £ = j, . . . , r — rrir + j, since 
the remaining terms are zero. Equation (16. ip follows immediately by taking 
the maxima over Ir-mr,dr and Imr+dr-r,dr over the left-hand and right-hand 
sides, respectively, of (16.31) . ■ 

For doubly stochastic transition matrices, the symmetry is even more 
apparent: 

Corollary 6.1 Let the transition matrix P of Theorem \5.S\ be doubly stochas- 
tic. Then, for every 1 < r < m — 1, 



t(-,-)Glr,m . . ^ ' 

m—r r+j 

. EE (^'("^•.^) - B'{u,,i-i)) := V^-^, (6.4) 



i=i e=j 



and so 



Moreover, 

(Ki,---,0 = (K?-\---,^c^-'^)- (6.6) 
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Proof. Since rrir = and = m for all 1 < r < m, the non-maximal terms 
on both sides of fl6.1l) disappear, and we have fl6.4p . 

To prove fl6.5p . recall that = Yl^=i^n = Then, from the result 
just proved. 



T/'m—r 



(m — r)n/m 



Y.T=i Rii - (m- r)n/m 



n 



n 



n - E7=m-r+i Ri)-{T^- r)n/m 



n 



rn/m - YJl-^-^^^ Rl 



^j=m—r+l n 



n 



•frm—r ±^ jrr 



(6.7) 



and we have established the claimed symmetry. 

Finally, the extension of (16. 4p to (16. 6p follows from a standard Cramer- 
Wold argument. ■ 

Turning again to the cyclic case, recall that, for m > 4, the limiting shape 
of the Young tableau in general differs from that of the iid uniform case. The 
following theorem characterizes the asymptotic covariance matrices of such 
Markov chains. 



Theorem 6.2 Let P be the m x m transition matrix of an aperiodic, irre- 
ducible, cyclic Markov chain on an m-letter, ordered alphabet. Am = {cn < 
a2 < ■ ■ ■ < am}, with 



( a\ 

a2 



ai 



as 



02 \ 

as 



(6.8) 



(^m—l 

Then, for I < j < m, Xj 
where uj = exp{27ri/m) is the m*^ principal root of unity. Moreover, let- 
ting 7j = Aj/(1 — Xj), for 2 < j < m, and j3j = cos{2nj/m), for < j < m. 



Em 
k=l 



02 ai y 

(^-1)0-1) j^g eigenvalue of P, 
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the asymptotic covariance matrix E is given by: 



For m = 2mo + 1, 

S = ^M(i) + A E ^e(7,)M(^), (6.9) 

i=2 

and /or m = 2mo, 

— 1 A 7"". 9 

S = ^M(^) + 4 E ^e(7.-)M(^) + 47mo+iM(-°+^), (6.10) 

where M^^^ is an m x m Toeplitz matrix with entries {M^^^)k,e — f3(j-i)\k-e\, 
for 2<j<m, and {M^^\,t = 4,^ - (1 - 4,^)/(m - 1), for j^l. 

Proof. It is straightforward, and classical, to verify that, for each I < j < m, 
{l,uj^~^,Lij'^^^~^\ . . . ,u;^'^~^^'^~^^) is a left eigenvector of P, with eigenvalue 
Xj = YlT=i 0'k^^''~^^^''~^^ ■ We can thus write our standard diagonalization of 
P as P — S~^AS, where A = diag{l, A2, . . . , A^), 



and 





(I 


1 




1 


1 \ 






1 














1 








^2(m-l) 






V 




^2(m-l) 










(I 


1 






1 1 


\ 




1 








• • • a;-(™-^) 


1 








. . . ^-2(m-l) 


m 
















V 


^-('«- 


1) u;-'^irn- 


1) 


... a;-(™-i)7 



(6.12) 



In the present cyclic, and hence, doubly stochastic case, we know that E = 
(1/to)(/ + S'^DS + (S'-'^DS)'^), where, as usual, D = diag('yi, 72, ... , 7m) 
= diag{— 1/2, A2/(l— A2), . . . , Am/(1— A^)). We can then compute the entries 
of S-^DS as follows: 
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k,e 

k,i 
m 

^ m 

k=l 

=;!;f-i+i:^'-''"''""')- (6-13) 

\ k=2 / 

for all 1 < ji,j2, < The entries of the asymptotic covariance matrix can 
thus be written as 



f 1 / 



\ \ k=2 

" (6-14) 



A:=2 

for all 1 < ji,j2, < "tTT-- 

Next, note that since Xm+2-k = ^k, we have ■ym+2-k = 7fe, for all 2 < 
k <m. Moreover, since /?|j2-ji|(fc-i) = /9|j2-ji|({m+2-fc)-i), we can write (16.141) 
more symmetrically as (16.91) or (I6.10p . depending on whether m is odd or 
even, respectively, and in the latter case, we also use that 7mo+i is real, since 
cu'"» = -1. ■ 

Let us again examine the cases m = 3 and m = 4. In the former case, we 
have 



But for m = 3, A = -1/2 = /52, and so M^^) = M^^\ Hence 



1 


-1/2 


-1/2 


1/2 


1 


-1/2 


1/2 


-1/2 
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E = ^M(^) + ^i?e(72)M(2) = ^(1 + 2i?e(72))M(^). (6.15) 

Hence, for m = 3, cyclicity always produces a rescaled version of the 
uniform iid case, with the rescahng factor given by 1 + 2it!e(72). 
For m — 4, however. 



and 



/ 1 




-1/3 


-1/3 - 


l/3\ 


-1/3 


1 


-1/3 - 


1/3 




/3 - 


-1/3 


1 


1/3 


V-1/3 - 


-1/3 


-1/3 


1 / 




0. Thus, 








/I 





-1 0\ 









1 


-1 






-1 





1 








-1 


l) 






/I 


-1 


1 -1\ 






-1 


1 


-1 1 






1 


-1 


1 -1 






v-1 


1 


-1 w 





In this case, we have 



16 16 16 

Next, note that 2M(2) + M^^^ = 3M(i). Then, if i?e(72) = 73, 

S = + ^i?e(72)M(2) + l73M(3) 

= ^^'^^'^ + ^(2i?e(72)M«) 
lo lo 

= ^(l + 2i?e(72))MW, (6.16) 

so that there is still a rescaled version of the iid case in a non-iid cychc setting. 
Indeed, since we know that A2 = ai + ia2 — 03 — ^04 = (oi — 03) + ^(02 — 04) 
and A3 = oi — 02 + 03 — 04, we find that 
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(a2 + 2a3 + a^y + (02 - a^y 

and 73 = 1/ (2(02 + 0.4)) — 1. A short calculation then shows that Re{pf2) = 73 
if and only if We thus have a complete characterization of all 

4-letter, cyclic Markov chains whose Young tableaux have the same limiting 
shape as the uniform iid case. In particular, choosing for some 

< a < 1/3, leads to 03 = a and oi = 1 — 3a. If, moreover, a = 1/4, we have 
again the iid uniform case. For a 7^ 1/4, however, we may view the Markov 
chain "lazy" version of the uniform iid case. 

Note that the scaling factor in both (16.151) and (16.161) is 1 + 2Re{j2)- The 
following theorem shows that, in fact, such a scaling factor occurs for general 
m, and gives a spectral characterization of all transition matrices which lead 
to an iid limiting shape. 

Theorem 6.3 Let P he the m x m transition matrix of an aperiodic, irre- 
ducible, cyclic Markov chain on an m-letter, ordered alphabet given in The- 
orem lKR Then the asymptotic covariance matrix is a rescaled version of 
the iid uniform covariance matrix Ejjd„ := ((m — l)/m'^)M^^^ if and only if 
the constants 7^ = Aj/(1 — for 2 < j < m, satisfy the condition 

Re{^j) = 7, for all 2 < j < m, (6-17) 
for some real constant 7. Moreover, the scaling is then given by 

S = (1 + 27)Siidu. (6.18) 
Proof. We first claim that the system of matrix equations 

m 

^6jM(^) = M(^) (6.19) 

has a unique solution bj = l/{m — 1), for all 2 < j < m. Indeed, revisiting 
(I6.14p . we can express each M^-'^ as 

= M'^^'> + M^"'-'+^\ (6.20) 
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where {M^^^)k,e = uj^^'^^^^'^^ /2, for all 1 < < m, so that fICT]) becomes 

m 

i=2 



i=2 



= J]6,M(^-), (6.21) 

i=2 

where 6j := (6j + 6m-j+i)/2, for 2 < j < m. 

Now, clearly, each M*^-'^ is cyclic, so that in solving fl6.2ip we need only 
examine the m entries in the first rows of the matrices. We can thus reduce 
fl6.2ip to the m x (m — 1) system of equations 



/ 1 


1 


1 




UJ 




oo' ■ 








■ 


. ^2(m-l) 




^2{m-l) 


j^3{m-l) . 





h 

\bmj 



/ 1 \ 

-1 

m— 1 
-1 



m— 1 



V m— 1 / 



(6.22) 



Since each of the last m — 1 rows of the matrix in (16.221) sums to —1, it is 
clear that bj = l/(m — 1) is a solution to the system. To see that this solution 
is, in fact, unique, consider the (m — 1) x (m — 1) sub-matrix consisting of 
the last m — 1 rows of the matrix in (16.221) , namely. 



UJ 



LU 



UJ 

UJ^ 



UJ 



2(m-l) 



m—l 



U) 



im-X)' 



(6.23) 



Now this matrix, which is very closely related to the Fourier matrix which 
arises in discrete Fourier transform problems, is in fact invertible, and can be 
shown to have one eigenvalue of —1, and m — 2 eigenvalues of the form ±^/m 
and ±i^/m, so that the modulus of the determinant is m*^™~^^/^ ^ 0. Thus, 



the solution bj = l/(m— 1) is unique, and since bj = {bj+bm-j+i)/2 = br, 
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for all 2 < j < m, we conclude that bj = l/(m — 1) as well, for all 2 < j < m, 
and the claim is proved. 

We can now use Theorem l6.2l to simplify the asymptotic covariance matrix 
decomposition as follows: 

m ^ — ^ 

fc=2 
fc=2 

= ^M(^) + 27^M(^) 
= (1 + 27)^M« 

= (1 + 27)S,,,„, (6.24) 

where 7 = i?e(7j), for all 2 < j < m. If the real parts of 7^ are not all 
identical, then the uniqueness of the solution of fl6.19l) implies that no such 
simplification is possible, and the theorem is proved. ■ 

Remark 6.2 To see that the condition in f l6.17p is not vacuous for any m, 
recall that form = 4, the "lazy" chain has the iid limiting shape. This is true 
for general m: if a2 = = ■ ■ ■ = = a, for some 0<a<l/(m — 1), then 
Xj = l — {m—l)a, for all2 <j<m. Trivially, then, 7^ = l/((m— l)a) — 1 : = 
7, for all 2 < j < m, so that the conditions of Theorem \6.'^ are satisfied, and 
the scaling factor is given by 1 + 2^ = (2 — (m — l)a)/((m — l)a). Even in 
the m = A case, however, we saw that there were other, more general, cyclic 
transition matrices which gave rise to the iid limiting distribution. 

The previous theorem indicates precisely when we may expect the limiting 
shape of a cyclic Markov chain to be identical to that of the iid uniform case. 
Now the first-order behavior of all rows of the Young tableau is n/m + 0{y/n) 
for cyclic Markov chains. Although this differs from the first-order behavior 
in the non-uniform iid case, one may still ask whether the limiting shape for 
a cyclic Markov chain might still be that of some non-uniform iid case. In 
fact, this can never occur: cyclicity ensures that the asymptotic covariance 
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matrix is also cyclic, and thus cannot be equal to the asymptotic co variance 
matrix of any non-uniform iid case. 

Still, we may ask how to relate the iid non-uniform limiting shape to 
that of a general Markov chain having the same stationary distribution. The 
following interpolation result describes the asymptotic covariance matrix for 
a Markov chain whose transition matrix is a convex combination of an iid 
(uniform or non-uniform) transition matrix and another arbitrary transition 
matrix having the same stationary distribution: 

Theorem 6.4 For any m > 3, let Pq be the m x m transition matrix of 
an irreducible, aperiodic, homogeneous Markov chain, and let its associated 
asymptotic covariance matrix be given by 

Eo = Ho + Uo{S,'DoSo) + {S.'DoSofUo, (6.25) 



in the standard notations of Theorem 4-1 Then, forO < 6 < 1, the transition 



matrix P = (1 — 5)/^ + 5Pq has an asymptotic covariance matrix given by 

S = i (So + (1 - 5)Sno) , (6.26) 

where Ehq is the covariance matrix associated with the iid Markov chain 
having the same stationary distribution as Pq. 

Proof. Using the standard notations of Theorem 14.11 we will write 

E = n + n{s-^DS) + {s-^DSfii 

in terms of the decomposition Sq in (16.251) . Now, clearly, the stationary 
distribution under P is that of Pq, so that 11 = Ho. We will thus write the 
stationary distribution simply as (tti, 7r2, . . . , vr^). Moreover, the eigenvectors 
are also unchanged, so that S = Sq. However, for each eigenvalue o of Po, 
we have that = (1 — 5) +6\kfl is an eigenvalue of P, for 1 < < m. Thus, 
for each 2 < k < m, the diagonal entries of D are given by 



Afc 

■~ 1-Afc 

^ (1-5) + 5\k,o 

5(l-A,,o) 

1-6 
= —J \- 1k,0, 
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where 7^,0 are the diagonal entries of Dq. We can thus decompose D as 
follows: 



D = diag(-l/2,72, . . . ,7„ 
= diag(-l/2,0,...,0)- 



1-6 



+ ( ^ ) diag(0,72,o, ...,7m,oj 



diag 



1-5 
26 



,0,...,0 



diag(0, 1, . . . , 1) 



(6.27) 



Next, recall from Proposition l4. ll that the first column of S* ^ is (1, 1, ... , 1)^. 
Hence, 



S-'DS = So'DSo 



(I * 







0\ / 



\i * ■■■ */ V 

\-6\ ^, 

-'m'^O 



TTi 7r2 



0/ V 



+ 



6 
( 



Sr, ImSn 



1-6 
26 



TTi 7^2 
TTi 712 



TTr, 



TTr, 



1-5 



* 



\ 



\tII TX2 ■ ■ ■ ■Km/ 



(6.28) 



which gives us 



liS-^DS = UqS-^DS 
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1-S 



/tti 

7r2 

: 

\0 ••• 

6' 



\ /tTi 772 ■ ■■ TTrvX 
Til 772 • • • TTr, 



+ |i^jno+Q)no5'o-^Do5'o 



25 



1X11X2 
7X1 



J 



no + Q I no^o-'^o-^o. 



yVTi 7r2 • • • TXmJ 



(6.29) 



Finally, we can express E as 



E = n + Y[{S-^DS) + (S^^L'^j^n 

= Ho + (^1 - Ho + no(5-^L'5) + {MS-'DS)f 



+ 1- 



TX2TXI 



7ri7r2 



\TXmT^l 7Xm'^2 

-Sno) 



1 



;So + (1 - 5)Eno) , 



7r27r„ 



(6.30) 



and we are done. ■ 

Thus far we have expressed our limiting laws in terms of Brownian func- 
tionals whose Brownian motions have a non-trivial covariance structure aris- 
ing directly from the specific nature of the transition matrix. It is of interest 
to instead express the limiting laws in terms of standard Brownian motions. 
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Since the asymptotic covariance matrix S is non-negative definite, we can 
find an m X m matrix C such that S = CC'^ . (The matrix C is not unique, 
since {CQ){CQ)'^ = CC^ = S for any orthogonal matrix Q.) Clearly, we 
then have 

ia,B\t),a2B\t), amB"'{t)f = C{B\t), B\t),..., B^{t)f, (6.31) 

where {B^(t), B^{t), . . . , B™'{t))^ is a standard, m-dimensional Brownian mo- 
tion, since 

¥.[{a,B\t), a,B\t), a^B^it)f (a^B^t), a^B^t), a„5-(t))] 
= E[C{B\t), B\t),..., B^^{t)f] [{CiB\t), B\t),..., B^^{t)ff 
= C [E{B\t), B'it),..., B-{t)f)iB\t), B\t), B-^it))] 

= ts. 

Next, we can, without loss of generality, assume that r(£) = i, for all i, 
and so write our main result (15.81) in Theorem 15.21 as 

-j^j. rUr r — TUr {dr+nir-r+j) 

max Yl 

^ k=l J=l 1=] 

■■= (6.32) 
Simply substituting fl6.3ip into fl6.32p immediately yields 



rrir I m 



k=l \i=l 

r—rrir (dr+nir-r+j) 

+ max J] J2 I 



^1 — m,r,dr ■ -, „ . \ ■ i 



■ 1 k=l 

m r- 

+ max Y,Y1 C^r+i,^BKAt^^,). (6.33) 



1=1 k=l 

m r—rur (dr+rrir-r+j) 



i=l j=l 
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Now the first term in fl6.33p is simply a Gaussian term whose variance 
can be computed exphcitly. Unfortunately, the maximal term does not in 
general succumb to any significant simplifications. However, in the iid case, 
we can further simplify (16.331) in a very satisfying way. 

Indeed, since, in the iid case, we have al = 7rfc(l — vTfc) and, for k ^ i, 
Ck,e = — TTfeTT^, one can quickly check that C can be chosen so that Ck^k = 
^/TTfc — a/^^A:; and, for k ^ i, Ck/ = — y^vr^. Moreover, for all rrir + 1 < 
k < rrir + dr, TTfe = TTrrir+i = ^r- Then, within the maximal term, Cmr+i,i = 
^Tiv — 'Kr\fTW-, for i = + and Cm^^i^i = —'Kr^fT^i-, for i ^ rrir + C-- With 
the convention that z/q = 0, we can then express (16.331) as 



i=l i=l k=l 

(■r-rrir (dr+rrir-r+j) 




-'■ j=l l=j 
rn r-rrir {dr+rrir-r+j) 




i=l j=l e=j 




1=1 i=l k=l 

fT—rrir [dr+mr-r+j) 





r-rtir {dr+rrir-r+j) 



+ ^/^Tr max 



J2 Yl B--^\At,,) 



j=i e=j 
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r—nir {dr+nir-r+j) 

+ 0F; max ^ J2 fi'"'-+^(Atj-^) 



rrir 

1=1 

m 

-{I'mr+'^rr) ^ y/¥iB\l)^ 
i=m,r+dr+l 
rrir+dr 

+ VT^r[-{Vm^ + T^rr) B\1) 

i=mr+l 
r-rrir {dr+nir-r+j) 

+ max E B^"'-^\M,,)]. (6.34) 

3=1 e=j 

Note that the first two Gaussian term of fl6.34p are independent of the 
remaining two Gaussian-maximal expression terms. 

Following Glynn and Whitt|14j and Barishnykovjl], who studied the 
Brownian functional 

m 

i=i 

we define the following, more general, Brownian functional: 

r (m-r+j) 

where 1 < r < m. Clearly, the maximal term in fl6.34p has just such a form. 
We also remark that -Dr,m corresponds to the sum of the r largest eigenvalues 
of an m X m GUE matrix. 

To better understand fl6.34p . we may, without much loss in generality, 
focus on the first block, that is, values of r such that rrir = 0. The first 
Gaussian term of fl6.34p thus vanishes, and, writing Umax for vr^, we have 



m 

i=di+i 
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di \ 

J2B\l) + Dr,dA. (6.36) 
i=i J 



In the uniform iid case, the first Gaussian term of fl6.36p itself vanishes, since 
dr = di = m, and we have 



yr 



^^'"^ (6.37) 



m 



For r = 1, this resuh corresponds to Theorem 4.1 of the authors' previous 
paper [18]. Furthermore, speciahzing fl6.36p to r = 1, 



T J — 



—F= -T^max > , V^i^ V-) 

Jn ^-^ 

/ 

5^ fi^(l) 



+ \/T^max i-^ T^max \ ^ B\l) 

+ \/T^maxii\A^- (6-38) 



One can easily compute the variance of the Gaussian terms in (16.381) to be 
T^max{X — d\-n max) I d\^ which is consistent with Proposition 4.1 of the authors' 
previous paper [18j . 

The iid development above suggests that we can find additional cases 
which yield simple functionals of standard Brownian motions. Indeed, the 
first property of the matrix C in the iid case that allowed the functionals 
to be simplified was that C^/ = Q, for all k ^ i,mr + 1 < A; < rrir + dr, 
and 1 < i < m, where Ci,C2, ■ ■ ■ ,Cm were real numbers. Then, writing the 
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diagonal terms of C as Ck^k = bk + Ck, for rrir + 1 < ^ < ""^r + d^., we may 
revisit fl6.33p . and write 

m rrir rn r—rrir (dr+mr—r+j) 

C = E E Ck,B\l) + max E E E C„,^-^i,B\/\t,,,) 

■ 1 I 1 J-r-mr,dr „ . 

1=1 k=l 1=1 ]=1 £=j 

m rrir y-r—rrir (dr+rrir-r+j) 

= EE^'^'^^'(1)+ max i E E &m.+^5'"'-+'(At„,) 

i = l k = l Ir-^r.dr L ^-^^ 

m r-rrir (dr+nir-r+j) 

+ E E E <^M^h^ 
i=i j=i i=j 

m rrir rn 

= EEc'Mi^^(i)+^E^^^^(i) 

1=1 k=l i=l 

r-rrir (dr+rrir-r+j) 

+ max E E &m.+^5™'-+^(At,-,) (6.39) 

Except for the fact that we have written the functional in terms of stan- 
dard Brownian motions, the maximal term in fl6.39p is no simpler than that 
of our original functional. However, the second property of the iid case that 
yielded further simplifications was that bk = b, for all rrir + ^ k < rrir + dr- 
In this case, fl6.39p becomes 



^- = EE^^.^^^w+^E^^^^(i) 

i=l k=l i=l 

r-rrir {dr+rrir-r+j) 

+ 6 max E E B"'^+\Atj^e) (6.40) 

Ir-,^r,dr .^^ 

Again, by focusing on the first block, we no longer have the initial Gaus- 
sian term, and (16.400 becomes 



i=di+i 
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di r (di-r+j) 

i=di + l \ i=l 

m d 

r 



=di+i 1=1 ^ ^' 

QS'(l)+r^(Q + -)5^(l) + 6i/,,,,. (6.41) 



m 

r 



We restate these results in the following theorem: 

Theorem 6.5 Assume, without loss of generality, that T{i) = i, for all 
1 < i < m, in the notations of Theorem \5.2i Moreover, let the asymptotic 
covariance matrix be given by = CC"^ , where C is an m^m matrix whose 
first di rows are given by 



Ck^i = ce, k =^ i,l < k < di,l < i <m 

Ck,k = b + Ck, I <k < di, 

for some real constants Ci,C2, ■ ■ ■ ,Cm and b. Then, for 1 < r < di. 



(6.42) 



m '^1 / J, 

V;; = r ^ c,B\l)+rJ2U + -]B\l) + bHr,,„ (6.43) 

i=di+l i=l ^ ^ 

where H^^di is the maximal functional 



r,d\ 



I dx r (di-r+j) 

\ «=i .j=i t=i 
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Remark 6.3 One can generalize Theorem \6.5\ to non-initial blocks (i.e., to 
r > di) by extending the conditions in f l6.42l) to non-initial blocks and then 
applying the theorem to — . 



To better understand which asymptotic covariance matrices S can be 
decomposed in this manner, the conditions Ck^i = Q, for all k i,l < k < 
di,l < i < m, and 6^ = b, for all 1 < k < di, imply that 



al = b' + 2bck + J2(^l (6-44) 



for 1 < A; < (ii, and 



1=1 



m 

<^k,e = bck + 6q + ^ c^, (6.45) 



i=l 

ioT 1 < k < e < di. 

If we let {Zi, Z2, . . . , Zm) be a centered Gaussian random vector with 
covariance matrix S, then (16.441) and (I6.45P give us 



= 2b\ (6.46) 

for all 1 < k < i < di. That is, the L^-distance between any pair {Zk, Zi) is 
the same, for 1 < k < i < di. 

Notice that if al = a"^, for all 1 < k < di, then in fact (16.461) implies that 
Pk,e = o'k//crk<^e = 1 ^ fe^/o"^ := p, for all 1 < k < i < di. That is, the di x di 
submatrix of S must be permutation-symmetric. 

Next, we note that, for 1 < k < i < di, 

al-al = 2b{ck-c,), (6.47) 

so that Ck = all {2b) + cq, for some constant Cq. Substituting this expression 
into (I6.44P and, writing V = J2^di+i obtain 
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= b' + al + 2bco + f2(^ + ^o) +r. 

i=i ^ ^ 

Writing cr^ = (^fii al)/di, for any r > 0, (16.481) gives us 



(6.48) 



b' + 2bco + f2(^ + CoY + T 

1=1 ^ ^ 



6 / I 462 
0. (6.49) 



In order for cq to be a real number, the discriminant of the quadratic 
equation in (I6.49P must satisfy 



26+^^)'-4rfJ6^ + ^ + rl>0, (6.50) 



which leads to the inequality 



(di - l)b^ - di{a^ -V) + '^[a^- (^2) j < 0. (6.51) 

This inequality, in turn, gives us constraints on b^. Indeed, the necessary 
and sufficient condition needed for such a b"^ to exist is given by examining 
the quadratic in b'^ in fl6.5ip at its extremal point, namely, at b'^ = di{a'^ — 
r))/{2{di — 1)). Doing so leads to the condition 



d'M'-rr \ , dl f- (- 

4((ii - 1) 



(^(t2)'^ < 0, (6.52) 



or simply. 
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since > (^(^^^ ■ The closer that T is to a^, the more similar that the 
di variances must be. Thus, (16.531) functions as a bound on the variability 
among these di variances. Provided that the variances satisfy (16.53^ . the 
condition on &^ is given by 



di 



2{d, - 1) 



[a^ - r) + - r)2 - (rfi - 1) (^a^ - [a^) 



(6.54) 

Now consider the doubly stochastic case, where di = m. Applying the 
general fact that each row of S must necessarily sum to zero, we use (16.441) 
and (I6.45P to find that, for each 1 < k < m, 

m 

^(^k,i = o-l + ^ak,i 

e=i n+k 



1=1 / li^k \ %=\ 



m 

2 



6^ + 6 j mck + Q 1 + m c; 



1=1 



= 0, (6.55) 
so that Cfc = c G M, for all 1 < A; < m. Substituting c back into (I6.55P gives 



us 



ak/ = 6^ + b{mc + mc) + m{m(?) 

1=1 

= (6 + mc)2 = 0, (6.56) 
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so that b = —mc. This then imphes that al = m{m — l)c^ and cr^^^ = — mc^, 
for a\\ 1 < k < m, i ^ k. But this is precisely a permutation-symmetric 
covariance matrix, which in the iid case corresponds to the class of Markov 
chains having a uniform stationary distribution. 
We summarize these results in the following: 

Theorem 6.6 In order that the asymptotic covariance matrix S have a de- 
composition E = CC'^ , where 

Ck,e = Q, k^i, l<k<di, l<i<m, 

Ck,k = b + Ck, l<k<di, 

for some real constants ci, 02, ... ,Cm and b, it is necessary and sufficient that 



/ \2 10-2 -ri 

^4 _ f^2\ < 1 ^ (6.58) 



VdT^ 

where T = Xll^di+i ^^'^ = (Sfii '^D/^i, for any r > 0. In this case, 



di 



2(cii - 1: 



(6.59) 



In particular, ifdi = m, the asymptotic covariance matrix must he permutation- 
symmetric, with Ck = c, for all k, and b = —mc, so that the common variance 
is m{m — l)c^ and the common covariances are all —mc^. 



7 Connections to Random Matrix Theory 

For iid uniform m-letter alphabets, the limiting law of the Young tableau 
corresponds to the joint distribution of the eigenvalues of an m x m matrix 
from the traceless GUE |22]. In the non- uniform iid case, we further noted 
that Its, Tracy, and Widom [20l [2T] have essentially described the limiting 
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shape as that of the joint distribution of the eigenvalues of a random matrix 
consisting of independent diagonal blocks, each of which is a matrix from 
the GUE. The size of each block depends upon the multiplicity of the corre- 
sponding stationary probability. In addition, there is a zero-trace condition 
involving the stationary probabilities on the composite matrix. 

As a first step in extending these connections between Brownian func- 
tionals and spectra of random matrices, recall the general case when the 
stationary probabilities are all distinct (see Remark 15. 5p . Our Brownian 
functionals then have no true maximal terms, so that the limiting shape, 
{Rla, R^ooi ■ ■ ■ 1 -^S) is simply multivariate normal, with covariance matrix E 
(or, more precisely, the matrix obtained by permuting the rows and columns 
of E using r, the permutation of {1,2,..., m} previously defined). Trivially, 
this limiting law corresponds to the spectrum of a diagonal matrix whose 
elements are multivariate normal with the same covariance matrix E. 

We can see that this general result is consistent with the non-uniform 
iid case having distinct probabilities. Indeed, each block is of size 1, and 
is rescaled so that the variance is T^r(i){)- — T^T{i))^ for 1 < i < m. Because 
of this rescaling, instead of having a generalized zero-trace condition, as in 
the non- rescaled matrices used in [201 IS], our condition is rather a true 
zero-trace condition. This zero-trace condition is clear, since the covari- 
ance matrix for any iid case (uniform and non-uniform alike) is that of a 
multinomial distribution with parameters (n = 1; t^t{i), '^t{2), ■ ■ ■ , T^rim)), and 
any {Yi,Y2, . . . , Ym) having such a distribution of course satisfies ^™ Yi = 
1, so that Var{Yy^iYi) = 0, which implies the zero-trace condition for 



Next, consider the case when each stationary probability has multiplicity 
no greater than 2. We conjecture that the limiting shape (-R;!^, R^, ■ ■ ■ , R^) 
is that of the spectrum of a direct sum of certain 1x1 and/or 2x2 random 
matrices. Specifically, let k < m be the number of distinct probabilities 
among the stationary distributions. Then the composite matrix consists of a 
direct sum of k, GUE matrices which are as follows. First, the overall diagonal 
{Xi,X2, . . . , Xm) of the matrix has a A^(0, E) distribution. Next, if dr = 1, 
then the GUE matrix is simply the 1x1 matrix (X^). Finally, if dr = 2, 
then the GUE matrix is the 2x2 matrix 



whose off-diagonal random variables ^^,.+1 and ^rrt,.+i are iid, centered, nor- 
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mal random variables, independent of all other random variables in the over- 
all matrix, with variance 



If such a conjecture were true, it would imply the following, more modest 
marginal result regarding a single block of such a matrix, which without loss 
of generality we take to be the first block. Specifically, if c^i = 2 and r(r) = r, 
for all 1 < r < m, we claim that = {V^,V^ - V^) is distributed 

as the spectrum (Ai, A2) of the 2x2 GUE matrix 

where Ai > A2. Equivalently, we will show that (y^,V^) is distributed as 
(Ai,Ai + A2). 

Let the 2x2 submatrix S2 of S be written as 



/ af pcricr2\ 
\pcr1a2 al ) ■ 



(7.2) 



Then 



^^2^2(1) + max (ai5i(t) - a2B\t)), 

a^B\l) + d,B\l)) . (7.3) 



We simplify (17.31) . by introducing new Brownian motions and then de- 
composing the resulting expression into two independent parts. To do so, 
begin by defining the new variances and correlation coefficients af := 5"|, 
cr| := af — 2paia2 + c^l, and p := {pai — (T2)/ a/ct^ — 2pai(X2 + 5-|. Then it is 
easily verified that B^{t) := B'^{t), and B'^{t) := {aiB^{t) - a2B'^{t)) / (T2 are 
(dependent) standard Brownian motions, and (17. 3p becomes 
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{VL Vl) = {a,B\l) + a, max B^{t), 2a,B\l) + ^^^^(l)) 

= UaiB\l) - pa^B^il)) + a^lp— + ma.^ B\t) 
\ \ ao o<i<i 



0-2 

2{a^B\l) - pa^B\l)) + (^2 + 2p(j^)B\l)] . (7.4) 



Note that B^{t) — pB'^it) is independent of -B^(t) and has variance (7^(1 — p^). 
Introducing the Brownian functional 

[/(/?)= 05^(1) + max (7.5) 

/3 G M, and using o"^, (Ta, and p above, (17. 4p becomes 



{VL. Vl) = a,y^rr^Z{l, 2)+[a,U[i;-p-],{a2 + 2pa^)B\l\ 



1 
2 



+ Waf-2paia2 + ff| f/ 



2a/ af - 2paia2 + 0-3, 
2(a^ - a2^)i?2(l)Y (7.6) 



where Z is a standard normal random variable independent of the sigma-field 
generated by B'^. 

Turning now to the eigenvalues' distributions, we first consider the cen- 
tered, multivariate normal random variables {Wi, W2), having covariance ma- 
trix 

aj po-io-2 

and let W3 and W4 be two iid, centered, normal random variables, indepen- 
dent of (14^1, 14^2), with variance cr|. Then it is classical that 



W2, sJWl + Wi + Wl) ^ a2(5(l),2 max 5(t) - 5(1)), 
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or, equivalently, 



(W2, PW2 + l^Jwi + Wi + ^a2{B{l),um, (7.7) 

where 5 is a standard Brownian motion, and U{(3), /3 G M, is defined in 

terms of B, ratlier tlian in terms of B^, as in (17.51) . Tlien consider tlie 
random variable 



= (w, - p^^ + (^p^ + ^Jwi + Wi + Wi^ . (7.8) 

Using (17.71) . and noting tliat tlie variance of tlie first term in (17.81) is af{l—p^), 
it is easy to see tliat 

A ^ aiVr^Z + 2a2f/(^), (7.9) 

wliere Z is a standard normal random variable independent of B. 

We now apply this result to the eigenvalues of the matrix Ai in (17. ip . 
namely, to 

. (^L±^) , + (7.10) 

and 

Letting Wi = (Xi + X2)/2, W2 = (Xi - X2)/2, W3 = Yi, and W4 = ^1, we 
have 



75 



(Ai, Ai + A2) = (Wi + ^Wi + Wi + Wi, 2Wi 

2(wi-p^W2] +2p^W2 

V (72/(72 

Wi-p^W2]{l,2) 



+ [P^W2 + 2 V ^2 + Wi + Wi, 2pfW2j , 

(7.12) 

where af = (crf+2pcri5-2+cr|)/4, (t| = (ai— 2p5-ia2+cr|)/4, and po-^o-f = (erf— 
cr|) /4 . Noting that the variance of W^i - (pai /(3-2 ) 14 /^2 is o-f(l-p^) = a f ( 1 - p^ ) , 
and that, moreover, /? := pai/2a2 = (af— a|)/(2Y/^f^^"2^55^^^~+5f), we find 
that 



(Ai,Ai + A2) = aix/r^Z(l,2) + (2a2t/(gi),2pgE2(l)) 

= (JiVl-p'^(l, 2) + (72(C/(/3), 4/35^(1)) 
= (7.13) 
and we have our identity in law. 

To illustrate the ways in which random matrix interpretations might po- 
tentially illuminate other, apparently unrelated, Brownian functionals, con- 
sider the following example. Let {ek)k>i be a sequence of positive numbers 
decreasing to zero. Then it is possible to find an increasing sequence of inte- 
gers {mk)k>i so that, for each k, there is a Markov chain on ruk letters such 
that: 

• the maximal stationary probability iTmaxik) is of multiplicity 3, and 

• the 3x3 covariance submatrix '^^{k) governing the associated Brow- 
nian functional V^{k) is of the form 
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'el 



Esik) = a{ky 10. (7.14) 



el 



That is, the variance of 5^*^^-* becomes arbitrarily large in comparison to that 
of and B^^^\ 

Then, since LIn{k) = Vjl{k), we have, as n ^ oo. 



^ max} ar(i)B^'^'>{Ati) 



n h.s 
a(A;)max(£fe(5^«(ti) -5^«(0)) + {B^^^\t2) - B^'^^\h)) 

+ {e,{B^(^){l)-B^^'\h))) 
V^ik), (7.15) 



so that, as k 



viik) 



max (5(t2) -5(ti)), (7.16) 



a{k) o<ti<t2<i 

where B{t) is a standard Brownian motion. The right-hand side of (17.161) is 
known as the local score, and describes the largest positive increase that B 
makes within the unit interval. Such functionals are of great importance in 
sequence comparison, particularly in bioinformatics {e.g., see Daudin, Etti- 
enne, and Vallois [10].) Moreover, 

max (i?(t2) - Bit,)) ^ max \B{t)\, (7.17) 

which follows immediately from the classical equality in law, due to Levy, 
(|-B(t)|)j>o = (maxo<s<i -B(s) — B{t))t>o- Thus, if we have a random matrix 
connection to V^{k), we can extend it to maxo<4<i |-B(t)|, at least in some 
limiting sense. This is also interesting from the following point of view. 

Classically, the Brownian functional maxo<j<i -B(t) = |-B(1)|, and a trivial 
random matrix connection can be seen by examining the eigenvalues of the 
random matrix 
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(7.18) 



where Z is a standard normal random variable. Then, clearly, Xmax has 
the half- normal law, since Xmax = niax(Z, —Z) = \Z\. Thus, the functional 
maxo<t<i B{t) has a random matrix interpretation, one which is considerably 
simpler than any potential random matrix interpretation for maxo<(<i 

8 Concluding Remarks 

In this paper, we have obtained the limiting shape of Young tableaux gen- 
erated by an aperiodic, irreducible, homogeneous Markov chain on a finite 
state alphabet. The following remarks indicate natural directions in which 
our results in some cases can, and in other cases, may hope to, be extended. 

• Our limiting theorems have all been proved assuming that the initial dis- 
tribution is the stationary one. However, such results as Theorem 2 of Der- 
riennic and Lin [TT] allow to extend our framework to initial distributions 
started at a specified state. Indeed, in this case, i.e., if for some k = 1, . . . , m, 
P(Xo = ak) = 1, the asymptotic covariance matrix is still given by fl4.12p . 
and, for example. Theorem 15.21 remains valid. For an arbitrary initial dis- 
tribution, what is needed in this non-stationary context is an invariance 
principle. More generally, our results continue to hold for fc*'^-order Markov 
chains, and in fact, they extend to any sequence for which both an asymp- 
totic covariance matrix and an invariance principle exist. 

• Our limiting theorems have only been proved for finite alphabets. How- 
ever, from the authors' previous work [12], it is known that for countably 
infinite iid alphabets, LJ„ has a limiting law corresponding to that of a non- 
uniform, finite-alphabet. Hence, for a countably infinite-alphabet Markov 
chain (subject to additional constraints such as Harris recurrence?), we might 
still be able to obtain limiting laws of the form developed in this paper. 

• By using appropriate existing concentration inequalities, one can expect 
to establish the convergence of the moments of the rows of the tableaux. 
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• One field in which the connection between Brownian functionals and ran- 
dom matrix theory has been exploited is in Queuing Theory. The develop- 
ment below, following O'Connell and Yor [25], shows how Brownian function- 
als of the sort we have studied arise as generalizations of standard queuing 
models. 

Let A{s,t] and 5(5,^], — oo < s < t < oo, be two independent Poisson 
point process on M, with intensity measures A and /x, respectively, with < 
X < fi. Here A represents the arrivals process, and S the service time process, 
at a queue consisting of a single server. The condition A < /i ensures that 
the queue length 

Q(t)= sup {A{s,t\-S{s,t\}, (8.1) 

— CXD<S<t 

is a.s. finite, for any t G M. Then, defining the departure process 

D{s,t] = A{s,t]-{Q{t)~Q{s)), (8.2) 

which is simply the number of arrivals during (s, t] less the change in the 
queue length during (s,t], the classical problem is to determine the distribu- 
tion of D{s,t\. The answer to this problem is given by Burke's Theorem |8] 
(see Theorem 1 of [25j): 

Theorem 8.1 D is a Poisson process with intensity \, and {D{s^t],s < t} 
is independent of {Q{s), s > t}. 

That is, D has the same law as the arrivals process A. Moreover, since,the 
queue length after time t is independent of the process D up to time t, one 
may take the departures from the first queue and use them as inputs to a 
second queue, and observe that the departure process from the second queue 
also has the law of A. Proceeding in this way, one generalizes to a tandem 
queue of n servers, each taking the departures from the previous queue as its 
arrivals process. 

One can further generalize this model to a Brownian queue in tandem 
in the following manner. Let B, B^, B^, . . . , B"- be independent, standard 
Brownian motions on M, and write B^{s,t) = B^{t) — B''{s), for each k and 
s < t, and similarly for B. Let m > be a constant, and define, in complete 
analogy to and (Q, 

qi{t) = sup {B{s,t) +B\s,t) -m{t- s)} , (8.3) 

— oo<s<t 
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and, for s < t, 

d,{s,t)=B{s,t)-{q,{t)-q,{s)). (8.4) 
For A; = 2, 3, . . . , n, let 

qkit)= sup {4-i(s,t) +5^'(s,t) -m(t-s)}, (8.5) 

— oo<s<t 

and, for s < t, 

dk{s, t) = 4-i(s, t) - (gfc(t) - qk{s)). (8.6) 

Here B is the arrivals process for the first queue, dk-i is the arrivals 
process for the k^^ queue (A; > 2), and mt — B^(t) is the service process for 
the k^^ queue, for all k. Using the ideas employed in Burke's Theorem, it 
can be shown that the generalized queue lengths gi(0), ^2(0), . . . , 5'n(0) are iid 
random variables. Moreover, they are exponentially distributed with mean 
1/m. 

Using the definitions in fl8.3p - fl8.6l) . and a simple inductive argument, one 
finds that 



n 




K(t)= sup {5i(-t,-s„_i) + --- + 5"(-si,0)}. (8.8) 

0<si<-- 

<Sm-l<t 

By Brownian rescaling, we observe that 



Ln{t) = Vt sup {5i(-l,-s„_i) + --- + 5"(-si,0)} 

0<si<-- 

<Sm-l<l 

= (8.9) 

where the functional is as in Theorem 15. 21 with associated rixn covariance 
matrix E = tin and parameter set Thus, L„(t) may be thought of as a 
process version of this V^. 
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The generalized Brownian queues in fl8.3p - fl8.6l) involved independentBiow- 
nian motions. These can be extended with Brownian motions B^, . . . , 5" for 
which {aiB^[t), . . . anB"'[t)) has (nontrivial) covariance matrix tE. Whether 
or not we keep the initial arrival process B{t) independent of [B^,. . . ,B^), we 
now no longer have that gi(0), ^2(0), . . . , 5'n(0) are iid random variables, due 
to the dependence among the service times mt — B^{t), but we do still have 
the identity (18.71) and fl8.9p relating the total occupancy of the queue at time 
zero to V^. More importantly, our generalizations of the Brownian function- 
al Lnit) above can be used to describe the joint law of the input/output of 
each queue. 

• An important topic connecting much of random matrix theory to other 
problems, such as the shape of random Young tableaux, is the field of orthog- 
onal polynomials. (See, e.g., [22].) It would be of great interest to see what, 
if any, classes of orthogonal polynomials are associated with the present pa- 
per. 
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